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Preface 


Set theory is a rich and beautiful subject whose fundamental concepts permeate 
virtually every branch of mathematics. Yet, most mathematics students receive 
only a cursory overview of the theory of sets in their lower division courses. Set 
theory is a subject that is sufficiently important and interesting to merit its own 
undergraduate course. The book is intended to offer the reader, or department, 
just such a course. 


This introductory textbook delivers a one-semester course in undergraduate 
set theory at the upper-division level. Sets, of course, are the central objects that 
shall be investigated. Since essentially every significant concept in modern 
mathematics can be defined in terms of sets, students who possess a fairly deep 
understanding of set theory will find a first course in abstract algebra or real 
analysis much less formidable and, perhaps, much more accessible. My first 
undergraduate course in set theory, for example, gave me a definite advantage 
over other students in my ensuing mathematics courses. 


In this book, the fundamental facts about abstract sets—relations, functions, 
natural numbers, order, cardinality, transfinite recursion, the axiom of choice, 
ordinal numbers, and cardinal numbers—are covered and developed within the 
framework of axiomatic set theory. Mathematicians have shown that virtually all 
mathematical concepts and results can be formalized within set theory. This has 
been recognized as one of the greatest achievements of modern mathematics, 
and, consequently, one can now say that “set theory is a unifying theory for 
mathematics.” 


The textbook is suitable for a broad range of readers, from undergraduate to 
graduate students, who desire a better understanding of the fundamental topics in 
set theory that may have been, or will be, overlooked in their other mathematics 
courses. I have made an effort to write clear and complete proofs throughout the 
text. 


Many modern books in undergraduate set theory are written for a reader who 
is well versed in mathematical argument and proof. My primary goal was to 
produce a book that would be accessible to a relatively unsophisticated reader. 
Nevertheless, I have composed completely rigorous proofs. In addition, these 
proofs favor detail over brevity. Another goal was to write a book that is focused 
on those topics that a student is likely to see in advanced courses (including 
graduate courses). Thus, the book is comparatively concise and can be covered 
in one semester. Most other undergraduate set theory texts cannot possibly be 
covered in a semester. 


On the Origins of Set Theory 


In the nineteenth century, mathematicians were investigating problems that 
concerned functions in real analysis. Some of these problems involved the 
representation of a function by a trigonometric series and the convergence of 
such series. The young Georg Cantor then proved a uniqueness theorem for 
trigonometric series. Cantor, in his proof, introduced a process for constructing 
collections of real numbers that involved an infinite iteration of the limit 
operation. Cantor’s novel proof led him to a deeper investigation of sets of real 
numbers and then to the concept of transfinite numbers. 


It has been said that set theory, as a subject in mathematics, was created on the 
day that Georg Cantor proved that the set of real numbers is uncountable, that is, 
there is no one-to-one correspondence between the real numbers and the natural 
numbers. In the two decades following this proof, Cantor developed a 
fascinating theory of ordinal and cardinal numbers. 


Cantor’s revolutionary research raised many difficult issues and paradoxes. As 
the axiomatic method had assumed an important role in mathematics, Ernst 
Zermelo developed an axiomatic system for set theory and published the first 
axiomatization of set theory in 1908. Zermelo’s axioms resolved the difficulties 
introduced by Cantor’s development of set theory. After receiving some 
proposed revisions from Abraham Fraenkel, in 1930 Zermelo presented his final 
axiomatization of set theory, now known as the Zermelo—Fraenkel axioms. 


Topics Covered 


The book presents the axioms of Zermelo—Fraenkel set theory and then uses 
these axioms to derive a variety of theorems conceming functions, relations, 
ordering, the natural numbers, and other core topics in mathematics. These 
axioms shall also be used to investigate infinite sets and to generalize the 
concepts of induction and recursion. 


The Zermelo—Fraenkel axioms are now generally accepted as the standard 
foundation for set theory and for mathematics. On the other hand, there are 
textbooks that introduce set theory using a naive point of view. As naive set 
theory! is known to be inconsistent, such a theory cannot seriously be offered as 
a foundation for mathematics or set theory. 


The basics of logic and elementary set theory are first discussed in Chapter 1. 
Since students, typically, easily grasp the topics covered in this chapter, it can be 
covered at a fairly quick pace. The chapter ends with a brief overview of the 
Zermelo—Fraenkel axioms. 


Chapter 2 examines the first six axioms of Zermelo—Fraenkel set theory and 
begins proving theorems, justified by the axioms, about sets. It is also shown that 
the set operations discussed in Chapter 1 can be derived from these six axioms. 


A relation is defined as a set of ordered pairs in Chapter 3, where equivalence 
relations and induced partitions are also discussed. Then a precise set-theoretic 
definition of a function in terms of ordered pairs is presented. The chapter ends 
with a section on order relations and a section on the concepts of congruence and 
preorder. 


In Chapter 4, after representing the natural numbers as sets, the fundamental 
principles of number theory (for example, proof by mathematical induction) are 
derived from a few very basic notions involving sets. 


Cantor’s early work on the “size of infinite sets” is the subject of Chapter 5. In 
particular, we explore the notion of a one-to-one correspondence between two 
sets and the concept of a countable set. 


Well-ordered sets and transfinite recursion are examined in Chapter 6. In 
Chapter 7, it is shown that the axiom of choice implies Zorn’s Lemma, the 
Ultrafilter Theorem, and the Well-Ordering Theorem. The theory of ordinals is 
carefully developed in Chapter 8, and the theory of cardinals is presented in 
Chapter 9. The last section of Chapter 9 discusses closed unbounded sets and 
stationary sets. 


How to Use the Book 


It is strongly recommended that the reader be acquainted with the basics of sets, 
functions, relations, logic, and mathematical induction. These topics are 
typically introduced in a “techniques of proof” course (for example, see [1]), and 
they will be more seriously discussed in this book. As the emphasis will be on 
theorems and their proofs, the reader should be comfortable reading and writing 
mathematical proofs. For example, one should know how to prove that a 
function is one-to-one and to prove that one set is a subset of another set. 


In the book, naturally, there is a progressive increase in complexity. The first 
five chapters cover the important topics that every mathematics undergraduate 
should know about the theory of sets, including the Recursion Theorem and the 
Schréder—Bernstein Theorem. The last four chapters present more challenging 
material, beginning in Chapter 6 with the principle of transfinite recursion. 
When applying definition by transfinite recursion there are two functional forms 
that may be used: a set function or a class function. In both of these cases, we 
prove that one can construct a new function by recursion. The two proofs are 
similar, except that the class form requires the replacement axiom. Moreover, the 
class version implies the set version. Students will likely find these technical 
proofs relatively difficult to follow or appreciate; however, they should be 
assured that such recursive definitions are valid and that one can read the rest of 
the book without possessing a deep understanding of these proofs. 


If time is short, certain topics can be overlooked. For example, the following 
sections can be skipped without loss of continuity: 


e 3.5 Congruence and Preorder 
e 4.2.1 The Peano Postulates 
e 7.2 Filters and Ultrafilters 


e 8.4 The Cumulative Hierarchy 
e 9.3 Closed Unbounded and Stationary Sets. 


Furthermore, the proofs presented in Chapter 7 of Zorn’s Lemma 7.1.1 and of 
the Well-Ordering Theorem 7.3.1 do not appeal to transfinite recursion on the 
ordinals; however, such proofs are carefully outlined in Exercises 14 and 15, 
respectively, beginning on page 193 of Chapter 8. Therefore, in Chapter 7, one 
could just highlight the proofs of 7.1.1 and 7.3.1 and then, after completing 
Section 8.2, assign these shorter “ordinal recursion” proofs as exercises. 


The symbol () marks the end of a solution, and the symbol (1 identifies the 
end of a proof. Exercises are given at the end of each section in a chapter. An 
exercise marked with an asterisk * is one that is cited, or referenced, elsewhere 
in the book. Suggestions are also provided for those exercises that a newcomer 
to upper-division mathematics may find more challenging. 
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1 
Introduction 


Georg Cantor devoted much of his mathematical career to the development of a 
new branch of mathematics, namely, Set Theory. Little did he know that his 
pioneering work would eventually lead to a unifying theory for mathematics. In 
his earlier work, Cantor took a set of real numbers P and then formed the 
derived set P’ of all limit points of P. After iterating this operation, Cantor 
obtained further derived sets P”, P’”,.... These derived sets enabled him to 
prove an important theorem on trigonometric series. This work led Cantor to 
investigate sets in a more general setting and to develop an abstract theory of 
sets that would dramatically change the course of mathematics. 


The basic concepts in set theory are now applied in virtually every branch of 
mathematics. Furthermore, set theory serves as the basis for the definition and 
explanation of the most fundamental mathematical concepts: functions, relations, 
algebraic structures, function spaces, etc. Thus, it is often said that set theory 
provides a foundation for mathematics. 


1.1 Elementary Set Theory 


The set concept is one that pervades all of mathematics. We shall not attempt to 
give a precise definition of a set; however, we will give an informal description 
of a set and identify some important properties of sets. 


A set is a collection of objects. The objects in such a collection are called the 
elements of the set. We write q@€ A to assert that a is an element, or a 
member, of the set A . We write a ¢ A when a is not an element of the set 4 . 
A set is merely the result of collecting objects of interest, and it is usually 
identified by enclosing its elements with braces (curly brackets). For example, 
the collection A = {3,7,11,7} is a set that contains the four elements 


3,7,1ll,q7.So7TE A,and8 ZA. 


Sets are exceedingly important in mathematics; in fact, most mathematical 
objects (e.g., numbers, functions) can be defined in terms of sets. When one first 
learns about sets, it appears that one can naively define a set to be any collection 
of objects. In Section 1.4, we will see that such a naive approach can create 
serious problems. 


Certain sets routinely appear in mathematics. In particular, the sets of natural 
numbers, integers, rational, and real numbers are regularly discussed. These sets 
are usually denoted by: 


N = {0,1,2,3,...} is the set of natural numbers. 
Z={...,—3,—-2,-1,0,1,2,3,...} is the set of integers. 
. @ is the set of rational numbers. Thus, 2 € Q . 

IR._ is the set of real numbers, and so 7 c R. 
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Basic Definitions of Set Theory 


In this section, we discuss the basic notation and concepts that are used in set 
theory. An object « may or may not belong to a given set A; that is, either 
x € A or x € A, but not both. 


Definition 1.1.1. The following set terminology is used extensively throughout 
mathematics: 


1. Let A and #8 be sets. We write 4 — 3 when both sets have exactly the 
same elements. 
2. For sets 4 and 9 wewrite A = B to mean that the set 4 is a subset of 


the set J}, that is, every element of 4 is also anelement of 3 . 


. We say that the set 4 is a proper subset of the set /3 , denotedby 4 C PB, 
when A C # and A sf BB, that is, when every element of A is an 


element of /3 but there is at least one element in /3 thatisnotin A . 
4. Wewrite @ for the empty set, or the null set. The set @_ has no elements. 
5. Two sets 4 and J? are disjoint if they have no elements in common. 


It follows that AC A and @ C A, for any set 4. To see why OCA, 
suppose that @ Z A. Then there exists an « € @ such that « ¢ A. As there is 


no x such that 2 € @, we arrive at a contradiction. Therefore, we must have 
that OC A. 


A Venn diagram is a configuration of geometric shapes, which is commonly 
used to depict a particular relationship that holds between two or more sets. In 
Figure 1.1(a), we present a Venn diagram that illustrates the subset relation. 
Figure 1.1(b) portrays two sets that are disjoint. 


(a) Venn diagram of A C B (b) Disjoint sets A and B 


Figure 1.1. Two set relationships A property is a statement that asserts 
something about one or more variables (for more detail, see Section 1.3). For 
example, the two statements “a is areal number” and “y € R and y ¢ N ” are 
clearly properties that assert something, respectively, about 2 and ¥. One way 
to construct a subset is called the method of separation. Let A bea set. Givena 
property P(a:) about the variable x , one can construct the set of objects « € A 
that satisfy the property P(x); viatnely? we can form the truth set 

{a € A: P(a)} . Thus, we can separate the elements in A that satisfy the 
property from those elements that do not satisfy the property. 


Problem 1. Evaluate each of the truth sets: 


1 A={zEN:3<2< I1]}. 
2 B={yeZ:y*=4}. 
3. C={zEN:z isa multiple of 3}. 


Solution. A = {4,5,6,7,8,9,10}, B= {2,—-2}, and C = {0,3,6,9,...} 
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An interval is a set consisting of all the real numbers that lie between two 
given real numbers a and #, where a < h: 


1. The open interval (a,b) is defined to be (a,b) = {zw ER: a<a<b}. 

2. The closed interval |a, b| is defined to be |a,b] = {a €CR:a<a<b}. 

3 The  left-closed _ interval (a, b) is defined to be 
a,b) ={zER:a<a<b}. 

4. The right-closed interval (a, | is defined to be (a, b|={wER:a<a<b} . 


For each real number a, we can also define intervals called rays or half-lines: 


. The interval (a, 00) is defined to be (a,o0) = {w ER: a<-a}. 

. The interval [a, 00) is defined to be [a,oo) = {a ER: a<a}. 

. The interval (—oo, a) is defined to be (—o0,a) = {w ER: ax<a}. 
. The interval (—oo, a] is defined to be (—oo, a] = {tw ER: a <a}. 
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The symbol co denotes “infinity” and this symbol does not represent a number. 
The notation oo is often used to represent an interval “without a right endpoint.” 
Similarly, the mathematical notation — oo is used to denote an interval “having 
no left endpoint.” 


Definition 1.1.2. Let 4 be a set. The power set of 4 , denoted by P(A), is the set 


whose elements are all of the subsets of 4 . That is, P(A) { min A} : 


Thus, X € P(A) if and only if X CA. If A is a finite set with n 
elements, then one can show that the set P(A) has 2” elements. The set 
A = {1,2,3} has three elements, and so P(A) has eight elements, namely, 


P(A) = {o, {1}, {2}, {3}, {1,2}, {1, 3}, {2, 3}, (1,2, 3}}- 


Set Operations 


For a pair of sets A and Bf, there are three fundamental operations that we can 
perform on these sets. The union operation unites, into one set, the elements that 
belong either to A or to #. The intersection operation forms the set of 
elements that belong both to A and to #). The difference between A and B 
(in that order) is the set of all elements that are in 4 and notin f. 


Definition 1.1.3. Given sets 4 and /3}, we can build new sets using the following set 
operations: 


@) AUB={x:2€ Aorx€ B} istheunionof A and B. 


) ANB {x -wtT€AandzreEe B} is the intersection of A and /}. 
A\ B={xa:x€Aand a ¢ B} is the set difference of A and B 


(also stated in English as 4 “minus” f} ). 


The set operations in Definition 1.1.3 are illustrated in Figures 1.2(a), 1.2(b), 
and 1.2(c). Shading is used to identify the result of a particular set operation. For 
example, in Figure 1.2(c) the shaded area represents the set A \ B. 


(a) Venn diagram of A | (b) Venn diagram of A 


one 8 > 


(c) Venn diagram of A\B 


Figure 1.2. Three set operations When the elements of sets 4 and #3 are 
clearly presented, then one can easily evaluate the operations of union, 
intersection, and difference. 


Example 2. Let A = {1,2,3,4,5,6} and B = {2,4,6,8, 10,12} . Then 


¢ AUB = {1,2,3,4,5,6,8, 10,12} and ANB = {2,4,6}. 


¢ A\ B={1,3,5} and B\ A= {8,10, 12}. 
° (A\ B)U(B\ A) = {1,3,5,8, 10, 12} . 
° (A\ B)N(B\ A) =a. 


Exercises 1.1 


Let A, B.and C besets. 


If a é A\B andq € A ,showthatq E€ B. 
Show that if A C B, then C\ BC Gi re 
Suppose A \ B CC. Show that A \ Gi 2. 
Suppose 4 C PB and A C C’.Showtha AC BNC. 
Suppose 4 2 BadBp AC — g. Show that ACB \ ae 
Show that 4 \ (B \ C) ~ (A x B) Lic, . 
Show that if A\ BCC and A Z C,thn ANB a Q. 
Let P(z) be the property 27 > 2. Are the assertions P(2) 5 P(—2), 
P(), P(-3) true or false? 
. Show that each of the following sets can be expressed as an interval: 
(@) (—3,2)N (1,3). 
(b+) (—3,4) U (0,00). 
() (—3,2) \ [1,3). 


1. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 


10. Express the following sets as truth sets: 
() A= {1,4,9,16,25,...}. 
o) B={...,-15,—-10,—5,0,5,10,15,...}. 


11. Show that each of the following sets can be expressed as an interval: 
(a) {tf ER: 27-1 <3}. 
(b) {f €R:2>0 and (x—-1)? <1}. 


12. Evaluate the truth sets: 

(a) A={xtEN:0 <2? < 24}. 

(bt) B={y€Z: y evenly divides 12}. 
(c) C= {z EN: 4 evenly divides z}. 
(dd) D={yeER°:l<y’ <4}. 


Exercise Notes: For Exercise 6, « ¢ B \ C means thatz ¢ B orzveEC. 


1.2 Logical Notation 


Before introducing the fundamentals of set theory, it will be useful to identify 
some relevant aspects of language and logic. The importance of correct logical 
notation to set theory, and to mathematics, cannot be overstated. Formal logical 
notation has one important advantage: statements can be expressed much more 
concisely and much more precisely. Set theory often expresses many of its 
important concepts using logical notation. With this in mind, we now discuss the 
basics of logic. 


Propositions and Logical Connectives 


A proposition is a declarative sentence that is either true or false, but not both. 
When discussing the logic of propositional statements, we shall use symbols to 
represent these statements. Capital letters, for instance, P, ©, FR, are used to 
symbolize propositional statements and are called propositional components. 
Using the five logical connectives /A,V,7,—+,4+ together with the 
components, we can form new logical sentences called compound sentences. For 
example, 


P AQ (means “ P and Q ” and is called conjunction). 

P\/Q (means “P or Q ” and is called disjunction). 

—P (means “not P” and is called negation). 

P — Q (means “if P, then @ ” and is called a conditional). 

P <Q (means “ P if and only if Q ” and is called a biconditional). 


2 ee ee 


Using propositional components as building blocks and the logical connectives 


as mortar, we can construct more complex compound sentences, for example, 
(P A 7Q) Vv (7S — R). Parentheses ensure that our compound sentences will 
be clear and readable; however, we shall be using the following conventions: 


1. The outermost parentheses need not be explicitly written; that is, one can 
write 4 A B todenote (AA B). 


2. The negation symbol shall apply to as little as possible. We can therefore 
use -A A B to denote (AA)A B. 


Truth Tables 


The truth value of a compound sentence in propositional logic can be evaluated 
from the truth values of its components. The logical connectives \, \V, 7, -, 
and <+ yield the natural truth values given in Table 1.1, where 7’ means “true” 
and #’ means “false.” 


Table 1.1. Basic truth tables 


P Q PAQ PQ PVQ 

rT? re & ee 
-P & op _ 
rr F FT 1 FT 
F F- F F F F 


(3) Negation 
(1) Conjunction (2) Disjunction 


PQ P3Q PQ P#@Q 


‘a | is I i % 
T FF T FF 
Fr fT FT  F 
FOF 7 FOF i. 
(4) Conditional (5) Biconditional 


Table 1.1(1) has four rows (not including the header). The columns beneath 
P and Q list all the possible pairs of truth values that can be assigned to the 
components P and (@. For each such pair, the corresponding truth value for 


P AQ appears to the right. For example, consider the third pair of truth values 
in this table, # 7’. Thus, if the propositional components P and @ are 
assigned the respective truth values /’ and 7’, we see that the truth value of 
PAQ is F. 


Table 1.1(2) shows that if P and @ are assigned the respective truth values 
7’ and /’, then the truth value of P V Q is 7’. Moreover, when P and @ are 
assigned the truth values 7’ and 7’, then the truth value of P V Q is also 7’. In 
mathematics, the connective “or” has the same meaning as “and/or”; that is, 
P\ @ is true if and only if either P is true or Q is true, or both P and Q 
are true. Table 1.1(3) shows that the negation of a statement reverses the truth 
value of the statement. 


Table 1.1(4) states that when P and @ are assigned the respective truth 
values 7’ and /’, then the truth value of P ~ @ is Ff’; otherwise, it is 7’. In 
particular, when P is false, we shall say that P —@Q_ is vacuously true. 
Table 1.1(5) shows that P <+ Q is true when P and © are assigned the same 
truth value; when P and @ have different truth values, then the biconditional is 
false. 


Using the truth tables for the sentences PAQ, PVQ,=AP, P + Q, and 
P <Q, we will now discuss how to build truth tables for more complicated 
compound sentences. Given a compound sentence, we identify the “outside” 
connective to be the “last connective that one needs to evaluate.” Once the 
outside connective has been identified, one can break up the sentence into its 
“parts.” For example, in the compound sentence =P V (Q A P) we see that V 
is the outside connective with two parts —P and Q A P. 


Problem 1. Construct a truth table for the sentence =P + (QA P). 


Solution. The components P and © will each need a column in our truth table. 
Since there are two components, there are four possible combinations of truth 
values for P and ©. We will enter these combinations in the two left most 


columns in the same order as that in Table 1.1(1). The outside connective of the 
propositional sentence =P — (QA P) is —. We can break this sentence into 
the two parts =P and @ A P. So these parts will also need a column in our 
truth table. As we can break the sentences =P and (AFP only into 
components (namely, P and (), we obtain the following truth table: 


P Q-7P QAP 7AP> (QAP) 


rr 2 F r fi 
[rT iF F F T 
ra = F F 
Ye F F 
STEP # 1 1 2 3 i 


We will describe in steps how one obtains the truth values in the above table. 
STEP 1: Specify all of the truth values that can be assigned to the components. 
STEP 2: In each row, use the truth value assigned to the component P to obtain 
the corresponding truth value for =P, using Table 1.1(3). sTEP 3: In each row, 
use the truth values assigned to © and P to determine the corresponding truth 
value in the column under @ A P via Table 1.1(1). STEP 4: In each row, use the 
truth values in the columns under =P and @ A FP to evaluate the matching 
truth value for the final column under the sentence ~P — (Q A P), employing 
Table 1.1(4). © 


Tautologies and Contradictions 


After constructing a truth table for a compound sentence, suppose that every 
entry in the final column is true. The sentence is thus true no matter what truth 
values are assigned to its components. Such a sentence is called a tautology. 


Definition 1.2.1. A compound sentence is a tautology when its truth value is true regardless 


of the truth values of its components. 


So a compound sentence is a tautology if it is always true. One can clearly see 
from the following truth table that the sentence P\y—P is a tautology: 
P =P PvV->P 
a 7 
Ff rT T 


Definition 1.2.2. A compound sentence is a contradiction when its truth value is false 


regardless of the truth values of its components. 


Therefore, a compound sentence is a contradiction if it is always false. One 
can easily show that the sentence P A —P is a contradiction. 


Logical Equivalence 


A propositional sentence is either a compound sentence or just a component. 
The next definition describes when two propositional sentences are logically 
equivalent, that is, when they mean the same thing. Mathematicians frequently 
take advantage of logical equivalence to simplify their proofs, and we shall do 
the same in this book. In this section, we will use Greek letters (e.g., a, 3, , 
and y) ; see page xiii) to represent propositional sentences. 


Definition 1.2.3. Let q» and Y be propositional sentences. We will say that y and Y 


are logically equivalent, denoted by ¥ <> (, whenever the following holds: For every 


truth assignment applied to the components of ~ and YY, the resulting truth values of 1 


and are identical. 


Problem 2. Show that =>(P VQ) @ =PA7Q. 


Solution. After constructing truth tables for the two statements —=(P \V Q) and 
aP A AQ, we obtain the following: 


rg PV@Q xaAfPV@) PO ar =a -PFA-@ 
a ft F rr 2. F F 
a: ti F an F i i F 
POT T PF F oT T F F 
FF F T FF T T T 


As the final columns in the truth tables for —-(P VQ) and ~PA-7Q_ are 
identical, we can conclude from Definition 1.2.3 that they are logically 
equivalent. ©) 


Whenever and w are logically equivalent, we shall say that 7 <= yp isa 
logic law. We will now present two important logic laws that are often used in 
mathematical proofs. These laws were first identified by Augustus De Morgan. 


De Morgan’s Laws (DML) 


lL. APY QO) eP A-e. 
2. (PAQ)e7PV-Q. 


Let uy and ¥ be propositional sentences. If one can apply a truth assignment 
to the components of y» and ¥ such that the resulting truth values of w and ¥ 
disagree, then y» and ¥ are not logically equivalent. We will use this fact in our 
next problem, which shows that the placement of parentheses in a compound 
sentence is very important. Note: A regrouping can change the meaning of the 
sentence. 


Problem 3. Show that sentences P V(QA-P) and (P VQ) A-=P are not 


logically equivalent. 


Solution. We shall use the truth table 


P Q PV(QARP) (PVQ)A-=P 


: a r I 
ae - F 
FOOT T T 
F F F F 


Since their final columns are not identical, we conclude that the propositional 
sentences P V (Q A =P) and (P V Q) A-P are not equivalent. © 


Propositional Logic Laws 


If a propositional component appears in a logic law and each occurrence of this 
component is replaced with a specific propositional sentence, then the result is 
also. a logic law. Thus, in the above De Morgan’s’ Law 
aAPVQ)#-7PA-Q, 


if we replace P and Q , respectively, with propositional sentences yw and ¥ , 


then we obtain the logic law ap V 9) & mY Ang, 


which is also referred to as De Morgan’s Law. 


Listed below are some important laws of logic, where w, , and Xx 
represent any propositional sentences. These particular logic laws are frequently 
applied in mathematical proofs. They will also allow us to derive theorems 
concerning certain set operations. 


De Morgan’s Laws (DML) 


1. Ale Vy) & AW. 


2. APAY) &7WV a9. 


Commutative Laws 

1 PAP SPAY. 

2, WVPSOPVy. 

Associative Laws 

1 PV(PVx)eWVeE)Vx. 
2. WA(PAX) S (PAY)AX. 
Idempotent Laws 

1 pAyvsy. 


2, vUVvSw. 


Distributive Laws 


1. pA(ePVx) > (WAY) VA x). 
2. WV (PAX) > HV Y)A(PV x). 
3. (PVX)AYS (PAY) V(XAY). 
4. (PAX) VPS (EVY)A(XVY). 


Double Negation Law (DNL) 


1, mp & Ww. 


Tautology Law 


1. w A (a tautology) + uv. 


Contradiction Law 


1. ow V (a contradiction) + w. 


Conditional Laws (CL) 


1. (b> y) @ (WV). 
2. (wv + yp) S&S Aly Ar-y). 


Contrapositive Law 


1. (p79 9) & (YAY). 


Biconditional Law 


L POY) (9) A> 9). 


The Tautology Law and Contradiction Law can be easily illustrated. Observe 
that ¢ V -y is a tautology. From the Tautology Law we obtain the following 
logical equivalence: y A (ye V ~~) & wv. On the other hand, because y A =y 
is a contradiction, it follows that uw V (yA -w) = by the Contradiction 
Law. 


Let @ and § be two propositional sentences that are logically equivalent. 
Now, suppose that @ appears in a given propositional sentence ©. If we replace 
occurrences of @ in © with @, then the resulting new sentence will be 
logically equivalent to ©. To illustrate this substitution principle, suppose that 
we have the propositional sentence =@ Vv a and we also know that a. 
Then we can conclude that (~QVa)@=(-=Q\V 8). Now, using this 
substitution principle and the propositional logic laws, we will establish a new 
logic law without the use of truth tables. 


Problem 4. Show that (P > R)A(Q —- R)} (PV Q)- R, using logic 
laws. 


Solution. We first start with the more complicated side (P + R) A(Q — R) 
and derive the simpler side as follows: 


(P+ R)A(Q — R)S (AP V R)A (AQ V R)_ by Conditional Law (1) 


#(APA-7=Q)VR by Distribution Law (4) 
&iAi(PVQ)VR by De Morgan’s Law (1) 
&(PVQ)-“R by Conditional Law (1). 


Therefore, (P+ R)A(QSR)S(PVQ)SR. © 


Using a list of propositional components, say A, B,C, D,..., and the logical 
connectives /\, V,7,—+,4+, we can form a variety of propositional sentences. 
For example, 


(P+ R)A-7(Q 4 (SVT). 


The logical connectives are also used to tie together a variety of mathematical 
statements. A good understanding of these connectives and propositional logic 
will allow us to more easily understand and define set-theoretic concepts. The 
following problem and solution illustrate this observation. 


Problem 5. Let 4 and B be any two sets. Show that a ¢ A \ B is equivalent 
to the statement « ¢ A orx EB. 


Solution. We shall show that « ¢ A\ BS (a € AV awe B) as follows: 
r€A\Be-AreA\B by definition of ¢ 


& A(x € AAw € B) by definition of \ 


SxrEAVrxEB by De Morgan’s Law and Double Negation. 


Therefore, « ¢ A \ B is equivalent to the assertion (a G AVa EB). © 


| 


Exercises 1.2 


1. Using truth tables, show that =P => Q) — (P A AQ). 

2. Construct truth tables to show that 
(P 6 Q) 8 (P+ Q)A(Q => P). 

3. Using truth tables, show that P <> (>FP = (Q A AQ)). 

4. Using truth tables, show that (? V Q) ARS (P A R) V tr A Q). 
Show that [FP — Q) \ (P —+ R) ~~ P- (Q A R), using logic 
laws. 

6. Show that (P — R) V (Q — R) = (P AN Q) —+ FR, using logic 
laws. 

7. Using propositional logic laws, show that P+(Q- R)S(PAQ)+R ; 

8. Show that iF — Q) —+ R adP-s (Q — R) are not logically 


equivalent. 


1.3 Predicates and Quantifiers 


Variables, for instance, « and ¥Y, are used throughout mathematics to represent 
unspecified values. They are employed when we are interested in “properties” 
that may be true or false, depending on the values represented by the variables. 
A predicate is simply a statement that proclaims that certain variables satisfy a 
property. For example, “a is a number” is a predicate, and we can symbolize 
this predicate by N(a). Of course, the truth or falsity of the expression N (2) 
can be determined only when a value for x is given. For example, the 
expression (4), which means “4 is a number,” is clearly true. 


When our attention is to be focused on just the elements in a particular set, we 
shall then refer to that set as our universe of discourse. For example, if we were 
just talking about real numbers, then our universe of discourse would be the set 
of real numbers RR . Furthermore, every statement made in a specific universe of 
discourse applies to just the elements in that universe. 


Given a statement P(a), which says something about the variable x, we 
often want to assert that every element x in the universe of discourse satisfies 


P(a). Moreover, there will be times when we want to express the fact that at 
least one element x in the universe makes P(a) true. We will thus form 
sentences using the quantifiers \Y and 3. The quantifier \Y means “for all” and 
is called the universal quantifier. The quantifier 3 means “there exists,” and it 
is identified as the existential quantifier. For example, we can form the sentences 


1. VaP(a) [means “for all x, P(a)”]. 
2. xP (x) [means “there exists an « such that P(a)”]. 


Any statement of the form Va P(x) is called a universal statement. A statement 
having the form Sa P(2) is called an existential statement. Quantifiers offer us a 
valuable tool for clear thinking in mathematics, where many concepts begin with 
the expression “for every” or “there exists.” Of course, the truth or falsity of a 
quantified statement depends on the universe of discourse. 


Suppose that a variable x appears in an assertion P(a). In the two statements 
VaP(a) and SaP(x), we say that 2 is a bound variable because x is bound 
by a quantifier. In other words, when a variable in a statement is immediately 
used by a quantifier, then that variable is referred to as being a bound variable. If 
a variable in a statement is not bound by a quantifier, then we shall say that the 
variable is a free variable. When a variable is free, then substitution may take 
place, that is, one can replace a free variable with any particular value from the 
universe of discourse—perhaps | or ©. For example, the assertion 
Va(P(a) — a = y) has the one free variable y . Therefore, we can perform a 
substitution to obtain Va( P(a) — 2 = 2). In a given context, if all of the free 
variables in a statement are replaced with values, then one can determine the 
truth or falsity of the resulting statement. 


There are times in mathematics when one is required to prove that there is 
exactly one value that satisfies a property. There is another quantifier that is 
sometimes used, though not very often. It is called the uniqueness quantifier. 
This quantifier is written as S!aP(a), and it means that “there exists a unique 
x satisfying P(a).” This is in contrast with Sa P(a), which simply means that 
“at least one x satisfies P(a).” 


As already noted, the quantifier 3! is rarely used. One reason for this is that 
the assertion 4!aP(a) can be expressed in terms of the other quantifiers 3 and 
Y. In particular, the statement <AleP(ax) is equivalent to 
dz P(x) AVaVy([P(x) A P(y)| 4 2 = y). 


The above statement is equivalent to 4!a;P(a) because it means that “there is an 
x such that P(a) holds, and any individuals 2 and y that satisfy P(a) and 
P(y) must be the same individual.” 


In addition to the quantifiers \¥ and 3 , bounded set quantifiers are often used 
when one wants to restrict a quantifier to a specific set of values. For example, to 
state that every real number x satisfies a property P(2), we can simply write 
(Va € IR)P(a). Similarly, to say that some real number x satisfies P(a), we 
can write (Sz € R)P(x). 


Definition 1.3.1. (Bounded Set Quantifiers) For each set A , we shall write 
(Var € A)F ‘ (x) to mean that for every x in 4, P (x) is true. Similarly, we will 


write (sx Ee A)P(a) to signify that forsome x in A, P(x) is true. 


The assertion (Va € A)P(a) means that for every x, if « € A, then P(x) 
is true. Similarly, the statement (Sa € A)P(a) means that there is an 2 such 
that z € A and P(x) is true. Thus, we have the logical equivalences: 


1. (V2 € A)P(x) 6 Va(a € A P(ax)). 
2. (Sa € A)P(x) & Ja(a € AA P(z2)). 


Quantifier Negation Laws (QNL) 


We now introduce logic laws that involve the negation of a quantified assertion. 
Let P(a) be any predicate. The statement Ya P(a) means that “for every x, 
P(a) is true.” Thus, the assertion =VYaP(a) means that “it is not the case that 


every x makes P(x) true.” Therefore, -Vz P(x) means there is an x that does 
not make P(a) true, which can be expressed as Sx—P(a). This reasoning is 
reversible as we will now show. The assertion Sa—P(.2:) means that “there is an 
x that makes P(a) false.” Hence, P(a:) is not true for every 2; that is, 
—VeP(a). Therefore, =VaP(a) and dxz4P(a) are logically equivalent. 
Similar reasoning will show that ~JaP(a) and Va—P(2) are also equivalent. 


We now formally state these important logic laws that connect quantifiers with 
negation. 


Quantifier Negation Laws 1.3.2. For any predicate P(a), we have the logical 
equivalences: 


1, “~WeP(2)  Se-P(z). 
2, saeP (2) 4 VeaP(e}. 


The above reasoning used to justify the quantifier negation laws can also be 
used to verify two negation laws for bounded set quantifiers. Thus, given a set 
A and predicate P(a:), the following two logic laws show us how statements of 
the form (Va € A)P(a) and (3% € A)P(a) interact with negation. Notice that 
when you push the negation symbol through a bounded set quantifier, the 
quantifier changes and the negation symbol passes over “wz € A.” 


Bounded Quantifier Negation Laws 1.3.3. For every predicate P(a), we have 
the logical equivalences: 


1. A(Va € A)P(x) & (Ax € A)-AP(2). 
2. a(ar € A)P(x) > (Va € A)AP(z). 


Quantifier Interchange Laws (QIL) 


Adjacent quantifiers have the form dasy, VaVy, Vasy, and ArVy. In this 
section, we will see how to interpret statements that contain adjacent quantifiers. 


When a statement contains adjacent quantifiers, one should address the 
quantifiers, one at a time, in the order in which they are presented. 


Problem 1. Let the universe of discourse be a group of people and let L(2, y) 
mean “x likes y.” What do the following formulas mean? 


1. drdyL(a, y). 
2. Sysclta,y). 


Solution. Note that “xz likes Y” also means that “¥Y is liked by x.” We will 
now translate each of these formulas from “left to right” as follows: 


1. SarayL(a,y) means “there is a person x such that SyL(a, y),” that is, 
“there is a person «2 who likes some person y.” ‘Therefore, 
xy L(x, y) means that “someone likes someone.” 

2. dydaxL (a, y) states that “there is a person Y such that dal (2, y),” that 
is, “there is a person Y who is liked by some person x.” Thus, 
4dysaL(a,y) means that “someone is liked by someone.” 


Hence, the statements JasyL(a,y) and SydxL(ax,y) mean the same thing. 


© 


Problem 2. Let the universe be a group of people and L(x, y) mean “x likes Y 
.” What do the following formulas mean in English? 


1. VaVyL(2, y). 
2. VyVaL(a, y). 


Solution. We will work again from “left to right” as follows: 


1. VaVyL(ax,y) means “for every person xv, we have that VyL(a, y),” that is, 
“for every person 2, we have that a2 likes every person ¥Y.” Hence, 


VaVyL(x, y) means that “everyone likes everyone.” 

2. WyVaL(x,y) proclaims that “for each person ¥, we have that VxL (x, y),” 
that is, “for each person Y, we have that ¥ is liked by every person x .” Thus, 
VyVaL(x2,y) means “everyone is liked by everyone.” 


So the statements VaVyL(a,y) and VyVaL(a,y) mean the same thing. © 


Adjacent quantifiers of a different type are referred to as mixed quantifiers. 


Problem 3. Let the universe be a group of people and (a, y) mean “x likes y 
.” What do the following mixed quantifier formulas mean in English? 


1. VadyL(a, y). 
2. AyVarL (x,y). 


Solution. We will translate the formulas as follows: 


1. VadyL(a, y) asserts that “for every person x we have that Syl (a, y),” that 
is, “for every person x there is a person ¥y such that x likes ¥.” Thus, 
VasyL(a,y) means that “everyone likes someone.” 

2. dyVaL(ax, y) states that “there is a person Y such that VaL(a,y),” that is, 
“there is a person Y who is liked by every person x.” In other words, 
dyVxL(x,y) means “someone is liked by everyone.” 


We conclude that the mixed quantifier statements VasyL(a,y) and 
dyVaeL(x,y) are not logically equivalent, that is, they do not mean the same 


thing. © 


To clarify the conclusion obtained in our solution of Problem 3, consider the 
universe U = {a,b,c,d} consisting of just four individuals with names as 


given. For this universe, Figure 1.3 identifies a world where SyVaL(ax,y) is 


true, where we portray the property L({a,y) using the “arrow notation” 


x — » y- Figure 1.3 illustrates a world where there is an individual who is 


very popular because everyone likes this person; that is, “someone is liked by 
everyone.” 


a 


‘ likes ' 
d d 


Figure 1.3. A world where SyVaL (a, y) is true, since someone is 
liked by everyone. 


Figure 1.4 presents a slightly different world in which ValyL(a,y) is true. 
So, in this new world, “everyone likes someone.” 


Figure 1.4. A world where VadyL(a, y) is true, because everyone 
likes someone. 


Let us focus our attention on Figure 1.4. Clearly, the statement Vasyl (zx, y) 
is true in the world depicted in this figure. Moreover, notice that SyVaL(a, y) is 
actually false in this world. Thus, VaSyL(a,y) is true and SyVaL(x,y) is 
false in the world presented in Figure 1.4. We can now conclude that 
VasyL(ax,y) and SyVaL(a, y) do not mean the same thing. 


Our solution to Problem 1 shows that SadyL(a,y) and SysaL(a,y) both 
mean “someone likes someone.” This supports the true logical equivalence: 


dedyL(x,y) = aysrL (x, y). 


Similarly, Problem 2 confirms the true _ logical equivalence: 
VaVyL (x,y) = VyVrL (a, y). 


Therefore, interchanging adjacent quantifiers of the same kind does not change 
the meaning. Problem 3, however, verifies that the two statements VydzL(2, y) 
and SaVyL(x, y) are not logically equivalent. We conclude this discussion with 
a summary of the above observations: 


e Adjacent quantifiers of the same type are interchangeable. 
e Adjacent quantifiers of a different type may not be interchangeable. 


We offer another example, involving the real numbers, which shows that the 
interchange of mixed quantifiers can change the meaning of a statement. 


Example 4. Let the universe of discourse be RR , the set of real numbers. 


1. Vasy(a + y= 0) means that for every real number x there is a real 
number Y such that «+y=0. We see that the sentence 
Vasy(a + y = 0) is true. 

2. AdyVa(a + y= 0) states there is a y € R such that Va(a + y = 0). 
This is false. 


Quantifier Interchange Laws 1.3.4. For every predicate P(2,y), the 
following three statements are valid: 


1. SesyP(2,y) & SyszeP(2,y). 
2. VavyP(x,y) = VyVxP(z, y). 


3. SoeveuP(2,y) => VyseP(x, y). 


We will be using the arrow => as an abbreviation for the word “implies.” The 
conditional connective — shall be reserved for formal logical formulas. It 
should be noted that the implication in item 3 cannot, in general, be reversed. 


The quantifier interchange laws also hold for bounded set quantifiers; for 
example, we have that 


da(say € A)P(2z,y) & (Ay € A)ArP(2, y), 
da(Vy € A)P(2, y) > (Vy € A)ArP(a, y). 


Quantifier Distribution Laws (QDL) 


A quantifier can sometimes “distribute” over a particular logical connective. The 
quantifier distribution laws, given below, capture relationships that hold between 
a quantifier and the two logical connectives \Y and /. In particular, the 
existential quantifier distributes over disjunction (see 1.3.5(1)), and the universal 
quantifier distributes over conjunction (see 1.3.6(1)). The following quantifier 
distribution laws can be useful when proving certain set identities. 


Existential Quantifier Distribution Laws 1.3.5. For any predicates P(x) and 
Q(x) we have the following distribution laws: 


aeP(z2) V axQ(2) > da( P(x) V Q(z)). 

(Sr € A)P(x) V (Ar € A)Q(x) & (Ax € A)(P(z) V Q(a)). 
(Sr € A)P(x) V (Ar € B)P(x) & (Ar € AUB)P(z2). 

(ar € AN B)P(a) => (ax € A)P(x) A (Ax € B)P(2). 


eS 


If R is a statement that does not involve the variable x , then we have: 


5. RA JeQ(x) > Ja(RA Q(z)). 
6. RA (ar € A)Q(z) > (Ar € A)(RA Q(z)). 


Universal Quantifier Distribution Laws 1.3.6. For any predicates P(a) and 
(Q(x) we have the following equivalences: 


1. VaP(x) AV2Q(x) & Va(P(2) A Q(2)). 
2. (Wa € A)P(x) A (Va € A)Q(x) & (Wa € A)(P(x) A Q(2)). 
3. (Va € A)P(x) A (Va € B)P(x) © (Va € AU B)P(a2). 


If R is a statement that does not involve the variable x , then we have: 


4, RVV2Q(r) 6 V2(RV Q(2)). 
5. RV (V2 € A)Q(x) & (Vz € A)(RV Q(z)). 


1.4 A Formal Language for Set Theory 


Cantor employed an informal approach in his development of set theory. For 
example, Cantor regularly used the Comprehension Principle: The collection of 
all objects that share a property forms a set. Thus, given a property P(a:), the 
comprehension principle asserts that the collection {a : P(a)} is a set. Using 
this principle, one can construct the intersection of two sets A and # via the 
property “a € A and » € B”; namely, the intersection of A and # is the set 
{a :a€ Aand x € B} . Similarly, we can form the union of A and B tobe 
the set {a2 :a€ Aor a € B}. In addition, we obtain the power set of A, 
denoted by P(A), which is the set whose elements are all of the subsets of A ; 
that is, P(A) = {X : X C A}. The comprehension principle allowed Cantor 
to establish the existence of many important sets. Today Cantor’s approach to set 
theory is referred to as naive set theory. 


Cantor’s set theory soon became an indispensable tool for the development of 
new mathematics. For example, using fundamental set theoretic concepts, the 
mathematicians Emile Borel, René-Louis Baire, and Henri Lebesgue in the early 
1900s created modern measure theory and function theory. The work of these 
mathematicians (and others) demonstrated the great mathematical utility of set 
theory. 


Relying on Cantor’s naive set theory, mathematicians discovered and proved 
many significant theorems. Then a devastating contradiction was announced by 
Bertrand Russell. This contradiction is now called Russell’s paradox. Consider 
the property «€¢2a, where x is understood to represent a set. The 
comprehension principle would allow us to conclude that A= {w: a € 2} isa 
set. Therefore, 
the set A consists of all the sets x that satisfy x € x. (a) 


Clearly, either A € A or A ¢ A. Suppose A € A. Then, as noted in (A), A 
must satisfy the property A ¢ A, which is a contradiction. Suppose A ¢ A. 
Since A satisfies A ¢ A, we infer from (a) that A € A, which is also a 
contradiction. 


Russell’s paradox thus threatened the very foundations of mathematics and set 
theory. If one can deduce a contradiction from the comprehension principle, then 
one can derive anything; in particular, one can prove that | — 2. Cantor’s set 
theory is therefore inconsistent, and the validity of the very important work of 
Borel and Lebesgue then became questionable. It soon became clear that the 
comprehension principle needed to be restricted in some way and the following 
question needed to be addressed: How can one correctly construct a set? 


Ernst Zermelo resolved the problems discovered with the comprehension 
principle by producing a collection of axioms for set theory. Shortly afterward, 
Abraham Fraenkel amended Zermelo’s axioms to obtain the Zermelo—Fraenkel 
axioms that have now become the accepted formulation of Cantor’s ideas about 
the nature of sets. In particular, these axioms will allow us to construct a power 
set and to form the intersection and union of two sets. These axioms also offer a 
highly versatile tool for exploring deeper topics in mathematics, such as infinity 
and the nature of infinite sets. 


Before presenting the axioms of set theory, we must first describe a formal 
language for set theory. This formal language involves the logical connectives 
A, V,7,-—-, © together with the quantifier symbols Y and 3. In addition, 
this formal language uses the relation symbols = and € (also ¥ and ¢€ ). 


What is a formula in the language of set theory? An atomic formula is one 


~ 


that has the form x € y or & = 2, where “, Y, 2 can be replaced with any other 
variables, say, a,b, x’, y’, 2',@,, 2, ys, X, Y,.... We say that X is a formula 
(in the language of set theory) if \ is an atomic formula, or it can be constructed 
from atomic formulas by repeatedly applying the following recursive rule: If 
and w are formulas, then the next seven items are also formulas: 
Vay, dy, (-vy), (PAY), (PV), (Pov), Yor). 

Hence, Vy(a € y+ (a2 =aVa€p)) is a formula in the language of set 
theory because it can be constructed from the atomic formulas «7 € y, v= a, 


x €p and repeated applications of the above recursive rule. Figure 1.5 
illustrates this construction, where the statement « ¢ p is used to abbreviate 


(n= € p). 


Vy(x Ey (x =aVx¢€ p)) 
v| 
(xEy— (x=aVx¢ p)) 
xE€y (x=avx¢€ p) 


x=a x€p 


xe Pp 


Figure 1.5. Construction of the formula Vy(a € y — («& = aV x € p)) 


Formulas are viewed as “grammatically correct” statements in the language of 
set theory. Moreover, the expression Va((—>+ xy is not a formula because it 
cannot be constructed from the atomic formulas and the above recursive rule. In 
practice, we shall use parentheses so that our formulas are clear and readable. 


We will also be using, for any formulas Y and w), the following three 
conventions: 


1. The outermost parentheses need not be explicitly written; that is, one can 
write ~ Ay to denote (vy A wv). 
2. The negation symbol will apply to as little as possible. We can therefore 


use =p A y to denote (=~) Ay. 
3. Bounded set quantifiers shall be used. Thus, we can abbreviate the 
formula Va(a € y + x ¢ a) by the more readable (Vx € y)(a € a). 


We will also use symbols that are designed to make things easier to understand. 
For example, we may write A C BP rather than Va(a2 € A+ 2 € B). 


Throughout the book, we will be using the notation y(w,...,z) to identify 


w,...,2 as being free variables (see page 14) that appear in the formula ¥. If 
the variables w,...,2 are free, then substitution may take place. Thus, we can 
replace all occurrences of x, appearing in ¥, with a particular set %o and obtain 
Ai, 205 z). Moreover, a formula y(z,...,2) may contain parameters, that 


is, free variables other than 2,...,2 that represent unspecified (arbitrary) sets. 
Parameters denote “unassigned fixed sets.” For an example, let y(a:) be the 


formula VY(t € ¥ + (t@ = OSV AE p)). 


So, v(x) has x as an identified free variable, @ as a constant, and a parameter 
p (an unassigned set). To replace a parameter P ina formula y(a,...,2) with 
an specific set Po means that every occurrence of P, in ¥ , is replaced with Po. 


We will now explore the expressive power of this set theoretic language. For 
example, the formula 4a(a € y) asserts that the set y is nonempty. Moreover, 


—JyVa(a € y) States that “it is not the case that there is a set that contains all 


sets as elements.” In addition, one can translate statements in English, which 
concern sets, into the language of set theory. Consider the English sentence “the 
set x contains at least two elements.” This sentence can be translated into the 
language of set theory by Syaz((yExAzeEar)Ay#z). 


Let y(z) be a formula with free variable z and let a be a set. The sentence 
“there is a set 2 whose members are just those ¥ ’s that satisfy y € a and yy) 
,” is represented by the formula SaVy(y € « 6 (yEaAyly))). 


Let y(a) and w(a) be formulas. Now consider the relationship 


a, if v(x); 
y b, if w(x) and not v(x); 
@, if none of the above hold. 
(1.1) 


This relationship can be translated into the language of set theory by 


(p(x) Ay = a) V (W(x) A(2)) Ay = 8) V ((A0(a) A 79(z)) Ay = 9). 
(1.2) 


Let y(a,y) be the formula in (1.2). One can verify that y(a,y) holds if and 
only if (1.1) holds. Note that for all x there is a unique ¥ such that y(2, y). 


Exercises 1.4 


1 
2 
3 
4. 
5 
6 


. What does the formula ArVy( y € x) say in English? 

. What does the formula Vysae(y o x ) say in English? 

. What does the formula Vysa(a g y) say in English? 

What does the formula Vyrda (a 2 y) say in English? 

. What does the formula Vzdarsy (x EyAye z) say in English? 

. Let (a) be a formula. What does VzVy((y(x) A y(y)) 4 z= y) 
assert? 


7. Translate each of the following into the language of set theory. 


(a) & isthe unionof @ andy. 

(b) «x isnot asubset of ¥. 

(c) & is the intersection of @ and /). 

(d) @ and f, have no elements in common. 


8 Let a, h, C', and J) be sets. Show that the relationship 
a, ifteC\D; 
b, ifa@E€C\D, 


can be translated into the language of set theory. 


1.5 The Zermelo—Fraenkel Axioms 


The axiomatic approach to mathematics was pioneered by the Greeks well over 
2000 years ago. The Greek mathematician Euclid formally introduced, in the 
Elements, an axiomatic system for proving theorems in plane geometry. Ever 
since Euclid’s success, mathematicians have developed a variety of axiomatic 
systems. The axiomatic method has now been applied in virtually every branch 
of mathematics. In this book, we will show how the axiomatic method can be 
applied to prove theorems in set theory. 


We shall now present the Zermelo—Fraenkel axioms. Each of these axioms is 
first stated in English and then written in logical form. After the presentation, we 
will then discuss these axioms and some of their consequences; however, 
throughout the book we shall more carefully examine each of these axioms, 
beginning in Chapter 2. While reading these axioms, keep in mind that in set 
theory everything is a set, including the elements of a set. Also, recall that the 
notation @(a,...,2) means that %,...,2 are free variables in the formula @ 


and that @ is allowed to contain parameters (free variables other than “,---, 2 ). 


1. Extensionality Axiom. Two sets are equal if and only if they have the 
same elements. 
VAVB(A= Bo Va(xEAvzre B)). 
2. Empty Set Axiom. There is a set with no elements. 
AAVa(a ¢ A). 
3. Subset Axiom. Let y(a) be a formula. For every set A there is a set 
S that consists of all the elements x € A such that (x) holds. * 
VAASVa(x@ ES & (x EAA GY(a))). 


4. Pairing Axiom. For every u and wv there is a set that consists of just 
uandv. 


VuVud AVa(x2 € AG (a@ =UuVax=v)). 


5. Union Axiom. For every set F there exists a set ) that consists of all 


the elements that belong to at least one set in F . 
VFAUVa(x2 EU H AC(t@ ECAC EF)). 


6. Power Set Axiom. For every set A there is a set P that consists of all 
the sets that are subsets of A . 


VASPVa(xzEPeVvy(yeroyeA)). 


7. Infinity Axiom. There is a set J that contains the empty set as an 
element and whenever x € J, thenxU{x}eET. 


JI(DPETAVa(ex Elo axU{a}e]))). 


8. Replacement Axiom. Let u(x, y) be a formula. For every set A , if for 
each x € A there isa unique ¥ such that u(x, y), then there is a set 
S that consists of all the elements Y such that w(x,y) for some 


rEA. (See endnote 2.) 
VA((Va € A)Alyy(a, y) + ASVy(y € S 6 (Ax € A)u(z,y))). 


9. Regularity Axiom. Every nonempty set 4 has an element that is disjoint 
from A. 


VA(A #4 OG Ar(x €E AATNA=BD)). 


The extensionality axiom simply states that two sets are equal if and only if 
they have exactly the same elements (see Definition 1.1.1(1)). The empty set 
axiom asserts that there exists a set with no elements. Since the extensionality 
axiom implies that this set is unique, we let @ denote the empty set. 


The subset axiom proclaims that any definable subcollection of a set is itself a 
set. In other words, whenever we have a formula y(a) and a set A , we can then 
conclude that {a € A: y(a)} is a set. Clearly, the subset axiom is a restricted 
form of the comprehension principle, but it does not lead to the contradiction 
that we encountered in Russell’s paradox. The subset axiom, also called the 
axiom of separation (see page 3), is described as an axiom schema, because it 
yields infinitely many axioms—one for each formula ¥. Similarly, the 
replacement axiom is also referred to as an axiom schema. 


The pairing axiom states that for any two given sets, there is a set consisting 


of just those two sets. Therefore, for all sets uw and w, the set {u,v} exists. 
Since {u,u} = {wu}, it follows that the set {uw} also exists for each w. 


The union axiom asserts that for any set *, there is a set {) whose elements 
are precisely those elements that belong to at least one member of #. More 
specifically, the union axiom proclaims that the union of any set F exists; that 
is, there is a set (/) so that » € U if and only if « € A forsome A € F. As 
we will see, the set (J is denoted by |) F. 


The infinity axiom declares that there is a set / such that @ € J and 
whenever x € J, then x U{x} €/. Since @ € J, we thus conclude that 
@U{o@}={o}el. Now, as {O}eE]1, we also have _ that 
{Sa}U{{o}}= {ao {ollel. as in this manner, we see that the set 
i must contain all the following sets: 


@, {2}, {9, {O}}, {9 {O}, (9, (oy, — 


Observe, by the extensionality axiom, that @ # {@}. One can also show that 
any two of the sets in the above list are distinct. Therefore, the set / contains an 
infinite number of elements; that is, 7 is an infinite set. 


The replacement axiom plays a crucial role in modern set theory (see [8]). Let 
A be aset and let (2, y) be a formula. Suppose that for each x € A, there is 
a unique Y such that «(a,y). Thus, we shall say that Y is “uniquely 
connected” to x . The replacement axiom can now be interpreted as asserting the 
following: If for each a € A there is an element ¥ that is uniquely connected to 
v , then we can replace each x € A withits unique connection ¥ and the result 
forms a new set. In the words of Paul Halmos [7], “anything intelligent that one 
can do to the elements of a set yields a set.” 


Given any nonempty set A , the regularity axiom asserts the AMa= @ for 
some a € A. Can a set belong to itself? The regularity axiom rules out this 
possibility (see Exercise 3). 


The formulas in the subset and replacement axioms may contain parameters. 
We will soon be proving theorems about formulas that may possess parameters. 


Because parameters represent arbitrary sets, any axiom/theorem that concerns a 
generic formula with parameters is applicable whenever the parameters are 
replaced with identified sets. As a result, such an axiom/theorem can be applied 
when a formula contains fixed sets, as these sets can be viewed as ones that have 
replaced parameters. For example, the subset axiom concerns a generic formula 
(a). So this axiom can be applied when specific sets appear in y(2). 


This completes our preliminary examination of the set-theoretic axioms that 
were first introduced by Ernst Zermelo and Abraham Fraenkel; however, we will 
more fully examine each of these axioms in the remainder of the book. 
Furthermore, before we make our first appeal to a particular axiom, it shall be 
reintroduced prior to its initial application. In addition, we will not invoke an 
axiom before its time; that is, if we are able to prove a theorem without 
appealing to a specific axiom, then we shall do so. Accordingly, we will not be 
using the regularity axiom to prove a theorem until the last section of Chapter 8. 


It is a most remarkable fact that essentially all mathematical objects can be 
defined as sets. For example, the natural numbers and the real numbers can be 
constructed within set theory. Consequently, the theorems of mathematics can be 
viewed as statements about sets. These theorems can also be proven using the 
axioms of set theory. Thus, “mathematics can be embedded into set theory.” 


Exercises 1.5 


1. Let uw, wv, and w be sets. By the pairing axiom, the sets {u} and {y, wh 
exist. Using the pairing and union axioms, show that the set { u,v, w} exists. 

2. Let 4 bea set. Show that the pairing axiom implies that the set { A} exists. 

*3. Let A bea set. The pairing axiom implies that the set { A} exists. Using the 
regularity axiom, show that An {A} @ . Conclude that A g rie 

4. For sets A and #3, the set {A, B } exists by the pairing axiom. Let 
A € B. Using the regularity axiom, show that AN {A, B } @ , and 
thus B é A. 

5. Let 4, /8, and ( be sets. Suppose that 4 € #8 and B € C’. Using the 
regularity axiom, show that C’ ms A . [Hint: Consider the set { A, B.C ,} ] 

6. Let A and /} be sets. Using the subset and power set axioms, show that the set 


P(A) NB exists. 


7. Let 4 and J9 be sets. Using the subset axiom, show that the set A \ B exists. 


8. Show that no two of the sets @ , {ao} ‘ {@ p {a} } are equal to each other. 
*9. Let A bea set with no elements. Show that for all 2 , we have that 7 € 4 
if and only if a7 € © . Using the extensionality axiom, conclude that 4 — @ 


10. is yp(z, y) be the formula Vz(z Eyor 


xr) which asserts that 
y= {x} . As noted on page 25, for all @ the set {x} exists. So 


Vadlyp(a,y). Let A be 


{{a}:a € A} isaset. 


a set. Show that the collection 


2 
Basic Set-Building Axioms and Operations 


Nearly all mathematical fields apply concepts from set theory, and practically 
every mathematical statement can be phrased into one about sets. For these 
reasons, set theory is often viewed as a foundation for mathematics. In this 
chapter, we will begin to investigate some of the axioms of set theory that were 
introduced earlier. Moreover, we shall begin proving theorems about sets, and 
each of our proofs will be justified by the axioms. We will also show that the 
sets and operations discussed in Chapter 1 can be derived from these axioms. 


2.1 The First Six Axioms 


The first six axioms, taken together, will allow us to develop the operations on 
sets that were discussed in the previous chapter. Furthermore, they will allow us 
to define additional operations on sets that were not previously covered. 


2.1.1 The Extensionality Axiom 


Our first axiom is a very simple, but necessary, statement that proclaims that a 
set is uniquely determined by its elements. In other words, two sets are different 
if and only if one set contains an element that is not in the other set. 


Extensionality Axiom. Two sets are equal if and only if they have exactly the 
same elements. 


For sets A and B, the extensionality axiom yields two alternative strategies 
for proving that 4 — B: 


(a) Prove that AC B and BCA. 


(b) Prove for all x, that g € A ifandonlyifeze PB. 


We will refer to strategy (a) as the “double-subset” strategy, and strategy (b) 
will be described as the “iff” strategy, where iff abbreviates the phrase “if and 
only if.” One efficient way of executing strategy (b) is to derive a succession of 
equivalences starting with 7 € A and then ending with z € B. This is usually 
done by citing appropriate definitions and logic laws (see Sections 1.2 and 1.3). 
We will illustrate the difference between these two strategies by presenting two 
proofs of Theorem 2.1.1 in Section 2.1.3. 


The extensionality axiom does not imply that there exists a set. Our next five 
axioms do assert that certain sets exist. In addition, these axioms can be used to 
establish the existence of many of the sets that we typically take for granted in 
mathematics. 


2.1.2 The Empty Set Axiom 


Our next axiom explicitly implies that “a set exists.” 


Empty Set Axiom. There is a set with no elements. 


Is there more than one set without any elements? The Extensionality Axiom 
implies that there is only one such set (see Exercise 9 on page 27), and we shall 
denote it by the symbol @ . 


The next four axioms will allow us to build new sets from given sets. 


2.1.3 The Subset Axiom 


Subset Axiom. Let (a) be a formula. For every set A there exists a set § 
that consists of all the elements x € A such that y(a) holds. 


The subset axiom states that for any set A and for any formula y({a:), one can 
construct a set that consists of just the elements in 4 that satisfy the property 
v(x). In other words, the set S= {x €A:y(a)} exists. Observe that 
SGA: 


Let A and B be two sets. The subset axiom implies that the intersection and 
ANB={xE A: xe B}, 


difference of these two sets exist, because A\ B={xeEA:a ¢ B}, 


and thus, by the subset axiom, 4B and A \ B are sets. On the other hand, 
the subset axiom does not imply, in general, that the union A U P isa set. 


Our next theorem follows from the extensionality and subset axioms. We will 
give two proofs of this theorem. The first proof employs the double-subset 
strategy, and the second proof applies the iff strategy (see page 28). 


Theorem 2.1.1. Suppose A, B, and © are sets. Then AN( B\C)=(ANB)\C 


First Proof. Let A, 2B, and C _ be sets. We _ prove _ that 
AN(B\C)=(ANB)\C. 


(C). Let r€ AN(B\C). Hence, xe A andxé€ B\C. Thus, ze A, 
x é€B,and x ¢C. Because x € A and x € B, wehave x € ANB. Since 
x € C, we conclude that x € (AN B)\C. 


(D>). Let e€(ANB)\C. Thuss x€ ANB and «x ¢C. Because 
x € ANB, we have that x € A and x € B. Wealso have that « ¢ C, and 
so we now conclude that « € B\ C. Furthermore, we know that x € A. 


Hence, t € AN(B\C). 


Therefore, AN(B\ C)=(ANB)\C.0 


In our first proof of Theorem 2.1.1, the annotations (C ) and (> ) are added 
as a courtesy to the reader. The notation (C ) is used to make it clear to the 


reader that we are proving that the first set! is a subset of the second set. The 


notation (> ) indicates that we are proving that the second set is a subset of the 


first set. We shall now reprove Theorem 2.1.1 using the iff strategy. 


Second Proof. Suppose that 4, 8, and © are sets. Let 2 be arbitrary. Then 


2 € AN(B\C) iff z€¢ AAzE(B\C) by 
iff re AN(@TE BAEC) by 
iff (t€ AATEB)ALEC by 
iff re ANBA&TEC by 
iff x € (ANB)\C by 


Therefore, AN(B\ C)=(ANB)\C.0 


The argument used in Russell’s paradox will be applied in the proof of our 


next theorem, which states that there is no set of all sets. 


definition of N 
definition of \ 
logical associativity 
definition of N 


definition of \. 


Theorem 2.1.2. There is no set in which every set is a member. 


Proof. Suppose, for a contradiction, that there exists a set in which every set is a 


member. Let A denote this set. Thus Va(a € A); that is, 


the set 4 contains all 


sets as members. By the subset axiom there exists a set 8 such that 


B={reEeA:rE€s}. 


(2.1) 
Observe from the definition of B we have that 
BeB iff Be{teA:zrE€gx} by (2.1) (4) 
* 

iff BE Aand BEB by definition of {2 € A: x ¢ z}. 


Because 3 is a set and A contains all sets as members, we conclude that 
BeEA.Now that B € A holds, the above (*) implies that B € B if and 
only if B € B, which is clearly a contradiction. O 


Since every set is equal to itself, we see that each set belongs in the collection 
{x : a =a}. Theorem 2.1.2 implies that {2 : 2 = a} is not a set. Thus, given 
a formula y(a), we cannot immediately conclude that the collection 
{x :y(a)} is a set. Hence, we shall refer to any collection of the form 
{x : y(a)} as aclass. Thus, {a : xz =x} is aclass that is not a set. If a class 


is not a set, then it is a proper class, and hence, it is an “unbounded collection” 
(see Exercise 34). 


In the future, we will sometimes be required to prove that some classes are 
actually sets (see the proof of Theorem 2.1.7). When can we prove that a given 
class {x : y(a)} is also a set? The following theorem addresses this question. 


Theorem 2.1.3. Let (a) be a formula. Suppose that there is a set 4 such that 
for all x , if y(a), then x € A. Then there is a unique set D such that for all 
x €D if and only if p(x). 


Zz, 


In other words, the class {2 : y(a)} is, in fact, equal to the set D. 


Proof. Let A bea set such that (A) for all w, if y(a), then 2 € A. By the 
subset axiom, let D be the set {a € A: y(a)}. Clearly, (4) implies that 


«x€D if and only if v(x), for all 2. The set D is unique by the 
extensionality axiom. L 


Remark 2.1.4. The regularity axiom implies that no set can be a member of 
itself (see Exercise 3 on page 26). Thus, in the proof of Theorem 2.1.2, the 
axioms of set theory imply that the collection 8, defined in (2.1), is equalto A. 


Remark 2.1.5. When applying the subset axiom, we can use a formula that is 
expressed in English, using appropriate symbols (e.g., @ , ™). Such English 
formulas can actually be written as formulas in the language of set theory. 


2.1.4 The Pairing Axiom 


The pairing axiom states that whenever we have two sets w and wv, there is a set 
P whose only members are wu and wv. We can use the extensionality axiom to 
show that this set P is unique. We shall denote the set P by {u,v}. 


Pairing Axiom. For every w and wv there is a set that consists of just uw and wv. 


Given any sets uw and w, the pairing and extensionality axioms justify the 
following definition. 


Definition 2.1.6. For sets @4 and wv, the pair set {u, uv} is the unique set whose only 


members are 7/ and 2). 


Thus, if a € {u,v}, then @ =u or a= v. The pairing axiom implies that 
for any set u, we also have the set {w,u}, which, of course, is equal to the set 
{u} by the extensionality axiom. So, given any set w , we can construct the set 


{u}. Since the set {uw} has only one element, such a set is referred to as a 
singleton. 


Suppose that we have three sets wu, uv, and w. The pairing axiom yields the 
sets {u,v}, {{u,v},w} , and {{u,v},{w}} . Can we conclude that there is a 
set containing only uw, uv, and w? We will be able to do this once we 
understand the union axiom. 


2.1.5 The Union Axiom 


Let F bea set. The union axiom allows us to construct a set that consists of all 
the elements that belong to at least one of the sets in F .* 


Union Axiom. For every set ¥ there is a set (/ that consists of all the elements 
that belong to at least one set in F. 


The union axiom states that, for any given set ¥, there exists a set [) whose 
members are precisely the elements of members in *. The set {/ given by the 
union axiom is denoted by |) ¥ . Thus, |) ¥ is the set of elements a such that 


—{7:2x2€C for some C ‘ 
a € © forsome ( ¢€ F;; that is, UF {2 ne nie . 7) 
We shall say that the set U F is the union of 7. So for any set x we have that 


rE JF if and only if (AC € F)(a € C). 


Example i Let PF =4) e.botre: Pte ear. Then 
LP = te, Orde, Pb. 


The union axiom, together with the pairing axiom, allows us to construct the 
union of any two sets 4 and 8. Let F = {A,B} . Then 


fA, B} = {x : x belongs to some member of {A, B}} 
{ex:xE€Aorze B} 
= AUB. 


Similarly, we have that 
U{A, B,C,D} = AUBUCUD and {A} = A. In addition, we have 
Ljg=2. 


Example 2. Consider the set F = {{a,b,c}, {e, f},{e,c,d}}. Then 
Lhe = fa, bc... f} =aUbUcUdUeU Ff. 


Suppose that we have three sets w, vw, and w. We can now use the pairing 
and union axioms to construct a set that consists of only wu, v, and w. We have 
the sets {u,v} and {w} by the pairing axiom. By the union axiom we have 
the set 


{u,v} U{w} = {u,v, w}, 


which consists of the three elements uw, v, and w. Similarly, one can assemble 
the set {a),2,%3,24} consisting of the four elements ©), 2,3, %4, and so 
on. 


We saw that the subset axiom allows us to construct the intersection of two 
sets. Our next theorem will allow us to take the intersection of all the sets that 
belong to a given nonempty set. 


Theorem 2.1.7. Let # be a nonempty set. There is a unique set J such that for 
all x, 


x El if and only if x belongs to every member of F. 
(2.2) 


Proof. Let # be a nonempty set. So let A € ¥. For each 2, if x belongs to 
every member of F, then clearly « € A. Since A isa set, Theorem 2.1.3 
implies that there is a unique set J satisfying (2.2). L 


Definition 2.1.8. Let J be a nonempty set. We denote the unique set J given in the 


above Theorem 2.1.7 by () F , which is said to be the intersection of F - 


Thus, if F is a nonempty set, then x€()F if and only if 
(VCE F)(x EC). 


Example 3. Suppose F = {{a,b,c,e}, {e, f}, {e,c,d}} . Then ()F = {e}. 


We see that (){A,B,C,D}=ANBNCND and [()\{A}=A. 
Furthermore, if @ € F,then()F = ©. 


Why does Definition 2.1.8 require that * # © ? Observe that the statement “ 
x belongs to every member of @ ” is vacuously true. To see this, suppose that 
this statement is false. Then 2 does not belong to some member of © , but the 
set @ has no elements! Therefore, the statement is true for all sets x . 


set. Thus, 


feb) 


Now, suppose ()2 is 


@ = {x: az belongs to every member of @}. 


Because the statement “ belongs to every member of © ” is true for every x , 
we conclude that (}@ must contain every set, and this contradicts 


Theorem 2.1.2. Therefore, the notation (| @ is undefined. 


Example 4. Let F = {{c,e, f}, {e, f}, {e, f,c, d}} . Then 


ANF =Mesh=ens. 
“UNF=Ulet=eur. 

. PUFF = hee 4. Ff} =cnidnen fs. 
. UUF =oed, f} =cudveu f. 


RWN RP 


The next example clearly states what it means for an element to be in, and not 
in, a union and intersection. 


Example 5. Suppose that ¥ is set. Then the following statements are true: 


(1) we€ UF means that 7 € © forsome C € F. 
(2) « € JF means that « ¢ C forevery CE F. 
(3) « €()F means that x € C forevery CE F. 
(4) « €(\F means that « ¢ C forsome C € F. 


We now demonstrate how one works with our generalized notions of union 
and intersection in the following theorem and proof. 


Theorem 2.1.9. Suppose that F CG. Then (JF CUG, and if F is 
nonempty, then ()G Cf)F. 


Proof. Let F C G.To prove that [JF CUG, let x € JF. Thus, x € C for 
some (© € fF. Since #- CG, we have that CEG. So x EC for some 
C €G.Hence, « € JG. Now, suppose that F is nonempty. Because F C G 
, we see that G is also nonempty. Let x € () GY . Thus, x € C forevery CEG 
. Since F C G, we conclude that 2 € C for every C € F as well. Therefore, 
ref)F.u 


2.1.6 The Power Set Axiom 


Recall that X C A means that X is a subset of A ; that is, every element in 
X isalso an element in 4 . Given any set A, the power set axiom ensures that 
one can form a set that consists of every subset of 4. 


Power Set Axiom. For every set 4 there is a set P that consists of all the sets 
that are subsets of 4. 


When A isa Set, the power set axiom states that there is a set P such that if 
X CA, then X € P. In addition, this axiom implies that if X € P, then 
XCA. 


eh 


Definition 2.1.10. Let 4 be a set. The power set of A , denoted by P(A), is the set 


whose elements are all of the subsets of  ; that is, ‘P? (A) {X , G* A} : 


Thus, as noted above, X € P(A) if and only if X C A. Recall that @ C A 
and A C A (see page 2). Therefore, @ € P(A) and A € P(A) for every set 
A .. The power set of a set A has more elements than A . For example, 


(9) = {9}. 

({a}) = {©, {a}}. 

({2,a}) = {9, {OG}, {a}, {P,a}}. 

» Pid 2 ot Hf etl eh teh tise lett ehiiaaore 


The power set operation distributes over the intersection of two sets (for a 
generalization, see Exercise 14 on page 40). 


Theorem 2.1.11. Let 4 and #8 be sets. Then P(AN B) = P(A)NP(B). 


Proof. Let 4 and B be sets. We shall prove that P(A NM B) = P(A) N P(B) 


(CC). Let X €E P(ANB). Hence, X CANB. Thus, X CA and 
X CB (see Exercise 4). So X € P(A) and X € P(B). Therefore, 
X €P(A)NP(B). 


(>). Let X € P(A)NP(B). Thus, X € P(A) and X € P(B). Hence, 
X CA and X CB. Therefore, X C AN B (see Exercise 4), and we now 
conclude that X € P(AN B). O 


Theorem 2.1.11 motivates the following simple question: Can one prove the 
equality P(A U B) = P(A) UP(B) for any two sets 4 and #8 ? The answer 
is no. Let A = {1,2} and B = {2,3}. Clearly, the set X¥ = {1,3} is subset 
of AUB, and thus, ¥ € P(AU 8B). Because X is not a subset of 4 and is 
also not a subset of 8, we see that X ¢ P(A) UP(B). So X € P(AUB) 
and X ¢ P(A) UP(B). Therefore, P(AU B) # P(A) UP(B). 


Exercises 2.1 


Prove the following theorems, where 4 , 3, C’,and J) are sets: 


1. Theorem. If A C B,thn AC AUB ad ANBCA. 
*2. Theorem. If 4 C B and BCC ,thn ACC. 

“3. Theorem. If 8 C C’, then 4 \ Co A \ B. 

“4. Theorem. C’ C A and C C B ifandonlyifC C ANB. 
5. Theorem. There exists an 2 such that 2 g A. 


6. Theorem. 490 B= BN A- 

7. Theorem. 4U) B = BUA. 

8. Theorem. aes C)= > (AN B)U (ANC). 
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. Theorem. A U (B (AU B)N(AUC). 
10. Theorem. 4 U (B C) ='( AU B) Cae. 
11. Theorem. 4 (B C) ={ AN B) c.. 


= 


2. Theorem. C\ (A M B) = (C \ A) U (C'\ B). 

3. Theorem. C” \ (A U B) = (C \ A) NM (C'\ By. 

14. Theorem. (A ‘ B) NM (Cc \ B) = (A M C) a 

15. Theorem. AN ( Bn C) — (A NB ) aL% and 
AU (B UCc}= (A U B) We". 

16. Theorem. (A U B) \ (A M B) = (A \ B) U (B i} A) : 

17. Theorem. If A \ B = C ,then A \ Ce 5. 

18. Theorem. If 4 C B ad BOC =@,thn ACB a. cf. 

19. Theorem. If A \ B C C and A Z C,thn ANB 7 @. 

20. Theorem. A C #3 if and only if P(A) Cc P(B). 

21. Theorem. P(A) U P(B) = P(A U B) 

22. Theorem. If P(A U B) C P(A) UP(B), tr AC Bo BCA. 
23. Theorem. Let 4 be aset. Then { A} c P(A). 

24. Theorem. Let 4 be aset. Then P(A \ B) # P(A) \ P(B). 

25. Theorem. Let F be aset andlet (’ € fF .Then C’ C U F. 

26. Theorem. Let F beaset andlet ©’ €  . Then () Fe. 

27. Theorem. Let F and A be sets. If A C C for some C’ € Ff, then we have 
that AC UF. 

28. Theorem. Let # @ and A besets. If A C C’ forall C € F, then we 
have that A C () F. 

29. Theorem. Let F and A be sets. Suppose ©’ C A for every (’ € F . Then 
UFCA. 

30. Theorem. Let 4 be aset. Then J P(A) =A. 

31. Theorem. Let 4 beaset. Then A C P (U A). 

* 32. Theorem. Let (’ © F. Then P(C) rai (P (U aes 

33. Theorem. The collection {x : dy (x — y)} is not a set. 


= 


“34. Theorem. Let p(x) be a formula. Then {x : y(a)} is a proper class if and 
only if for every set A there isan 2 such that p(x) and x g A. 


35. Theorem. Define the symmetric difference of any sets .¥ and Y to be the set 


XAY= (X \ Y) U (Y \ X ). Then for sets A, B.C wehave 


@ AN(BAC)=(ANB)A(ANC). 
) AA(BAC)=(AAB)AC. 


Exercise Notes: Exercises 8—9 are referred to as the distributive laws for sets. 
Exercises 10—11 are called the associative laws for sets. 


2.2 Operations on Sets 


Sets can be combined in a number of ways to construct another set. We will now 
see how the first six axioms of Zermelo—Fraenkel set theory can be used to prove 
a number of theorems concerning several important methods, not identified in 
the previous section, for constructing a new set from given sets. 


2.2.1 De Morgan’s Laws for Sets 


Lemma 2.2.1. Let 4 and F be sets. Then there is a unique set 1 such that for 
al y, 


Y €D iff Y=A\C for some C € F. 
(2.3) 


We shall let {A \ C : C € F} denote the set D. 


Proof. Let 4 and F be sets. If Y = A\ C forsome C € F, then Y CA 
and so Y € P(A). Since P(A) is a set, Theorem 2.1.3 implies that there exists 
a unique set — satisfying (2.3). U 


The “double-subset” proof strategy will be used to prove our next theorem. 
Before reading this proof, one should review items (1)-(4) of Example 5 on 
page 34 and note that x Ef(}{A\C:CeEF} iff ce A\C for every 
Cef. 


Theorem 2.2.2 (De Morgan’s Laws). If A is aset and F is nonempty, then 


(1) A\UF=(YA\C:CeEeF}. (2) 


A\A\F=U{A\C:CeF}. 
Proof. We prove that A\|JF =(\{A\C:C¢€ Fy} and leave (2) as an 


exercise. 


(Cc). Let rx€ A\UF. We prove that rE f\{A\C:CEF} as 
follows: 


zE€A\|JF= reaA and « ¢ |_)F by definition of \ 
=> «€ Aand x é€C for every C € F by definition of U 
=> xr€A\C for every CE F by definition of \ 
=> xé€(}{A\C:CeF} by definition of (). 


Therefore, A\IJF CP{A\C: CEFF}. 


(>). Let x Ef){A\C:C € F}. We prove that x € A\ UF as follows: 


xe( {A\C:CEF} 


=> x€A\C forevery CE F by definition of a 
=> x€Aand x ¢ C for every C € F by definition of \ 

> r€Aandz¢ LF by definition of U 
> c€A\(JF by definition of \. 


Therefore, (\{A\C: C € F} C A\ UF. The proof of (1) is complete. 


2.2.2 Distributive Laws for Sets 


Lemma 2.2.3. For all sets 4 and #, there exist unique sets — and € such that 


for all ae 
Y €D iff Y = AUC for some C € fF, (2.4) 
Y €€ iff Y= ANC for some C € F. (2.5) 


We let {AUC:C € F} denote the set D and {ANC:C EF} denote 
the set €. 


Proof. Let A and F be sets. If Y—=AUC for a CeF, then 
Y CAU(UF). Thus, Y € P(AU(UF)). As P(AU(UF)) is a set, 
Theorem 2.1.3 asserts that there exists a unique set — satisfying (2.4). In a 
similar manner, one can show that there is a unique set £€ satisfying (2.5) (see 
Exercise 5). LJ 


Theorem 2.2.4 (Distributive Laws). If 4 and F are sets, then 


(1) AU((I)F) =(YAUC: Ce F} for FFD, (2) 
AN(UF)=U{ANC:CeEeF}. 


Proof. We prove (1) using the “iff” strategy and leave (2) as an exercise. Let x 
be given. We prove that x€ AU((\F) iff cEef\{AUC:CEF} as 
follows: 


xeEAU (M F) iff € Aorxve [)F by def. of U 
iff e€ Aor x€C for every CE F by def. of () 
iff « € AUC for every CE F by def. of U 


iff x € ( {A UC:CeF} by Lemma 2.2.3. 


The above proof implicitly applied the quantifier distribution law (QDL) 
1.3.6(5). We demonstrate this in the following more “symbolic” proof: 


xE AU (=) iff te AVae wey by def. of U 
iff re AV(VCEF)xEC by def. of () 
iff (WCE F)(xE AVEC) by QDL 1.3.6(5) 
iff (VC € F)(a € AUC) by def. of U 
iff ce( }{AUC:CEF} _ by Lemma 2.2.3. 


We now prove a lemma that will justify some of the exercises of this section. 


Lemma 2.2.5. If ¥ is a set, then there exists a unique set 7 such that for all Y 
Y €D iff Y = P(C) for some C € F. 
(2.6) 


The set D is denoted by {P(C):C € F}. 
Proof. Let # be any set. If Y =P(C) for some Ce fF, then 


Y € P(P(UF)) by Exercise 32 on page 37. As P(P (J F)) is a set, there 
exists a unique set — satisfying (2.6) by Theorem 2.1.3. LJ 


Exercises 2.2 


Prove the following theorems: 


1. Theorem. Let A be aset and FFD . Then A\ (1) F Ut A\C ‘C' EF}. 
2. Theorem. Let A and F be sets. Then 


AU(UF) =U{AUC: Ce F}. 

3. Theorem. Let A and F be sets. Then 
AN(UF) =U{ANC: Ce F}. 

4. Theorem. Let f @. Then AN ((\F) = (\{A AG:ceE a for 
any set A . 

*5, Theorem. Let 4 and F be sets. Then there exists a unique set € such that for all 
Y we have that Y € € ifandonlyif Y — AMC forsome (’ € F. 

6. Theorem. Let and G be sets. Then there is a unique set J) such that for all Y, 
we have that Y € J) if and only if Y — AP forsome 4 € F and some 
BEG. 

7. Theorem. (LJ F)N (UG) =U{ANB: AEF and BEG} for 
any sets F and G. 

8. Theorem. Let F and G be sets. Then there is a unique set J) such that for all Y, 
we have that Y € J) if and only if Y AUB for some A € F and some 
Beg. 

9. Theorem. ((]F)U(()G) =(\{AUB:AEF and BEG} 
when F and G are nonempty. 

10. Theorem. (U F)NU G) = LJ{A M B:ACF and Beg} for any sets 
F and G. 

11. Theorem. Let F and G be sets. Then UF U G) = (U Fy) U (U G). 

sagt 2 Theorem. If F and G are nonempty, then 
MFUS) = (NFAY). 

13. Theorem. Let F and G be sets. Then U(F M G) 2 (U F) M (U G). 
“14, Theorem. Let F be a nonempty set. Then 
P()VF)=({P(C) : Ce F}. 

15. Theorem. Let F be aset. Then |){P(C) : C € F} C P(\U F). 

16. Theorem. Let F be a set. If there is an A € F such that ©’ C A for all 


Ce F.ten P(YUF) CU{P(C): Ce F}. 


3 
Relations and Functions 


In mathematics, relations and functions are usually defined in terms of ordered 
pairs. The concepts of an ordered pair, a relation, and a function are essential for 
mathematics. Is there a way to define these three notions in terms of sets? We 
will show that this, in fact, can be done. 


3.1 Ordered Pairs in Set Theory 


Before discussing relations or functions, we must first define an ordered pair. 
The set {2,3} is an unordered pair because {2,3} = {3,2}. We need to 
define an object of the form (2,3) that captures the notion of the first 
component 2 and the second component 3. Furthermore, (2,3) must be a set, 
and we must have that (2,3) 4 (3,2). 


Can an ordered pair be defined to be a set? If it can, then such a definition 
must satisfy the key property 


(a,b) = (c,d) if and only if a= cand b= d. 
(3.1) 


The equivalence (3.1) ensures that an ordered pair has a unique first component 
and a unique second component. In 1921, Kazimierz Kuratowski answered the 
above question with the following definition. 


Definition 3.1.1. The ordered pair (x. y) , with first component 2 and second 


component ¥ , is the set defined by (2, y) {{x} ’ 1c; y} }. 


pO 


Clearly, for sets x and ¥Y, the above (x,y) is a set. To prove that 
Kuratowski’s definition satisfies (3.1), we first establish a lemma. 


Lemma 3.1.2. Let U,v,2,Y be given and suppose that (a,y) = (u,v) . If 
“=Yoru=v,then «=u andy=v. 


Proof. Assume (x,y) = (u,v). So (a) {{x}, {z,y}} = {{u}, {u, v}}. If 
z=y, then {{x}, {2,y}} = {{x}, {x,2}} = {{z}, {x}} = {{r}} . Thus, 
by (A), {{a}} = {{u}, {u, v}}. 


Hence, {x} = {u}= {u,v}. So x=u=v. Therefore, as ©=Y, we 
conclude that a =u and y=v. If w=v, a similar argument shows that 
=u ady=v.U 


Theorem 3.1.3. Let U,V, X,Y be given. Then 
(x,y) = (u,v) tf and only ifx =u andy=v. 


Proof. Assume that (x,y) = (u,v). If either © =¥Y or u=v, then 
Lemma 3.1.2 implies that « = uw and Y = U. Suppose that « A y andu fv. 
Since 


{{a}, {a, y}} = {{u}, {u, v}}, 
(3.2) 


we conclude that either {a} = {u} or {a} = {u,v}. Since u # v, we must 
have {a} = {u}, and so x= uw. Thus, {{x}, {z,y}} = {{x}, {a,v}} by 
Equation (3.2). Since « # y, we infer that {w, y} = {a,v}, and since x # y, 
we deduce that ¥ = v. 


Conversely, if 2 = u and y = v, then clearly (a, y) = (u,v) .O 


Let A and Bf be sets. Let x € A and y € B. Note that the sets {a} and 
{x,y} are both elements in P(AUB). Therefore, 
{{x},{z,y}} C P(AUB), and hence {{x}, {2,y}} € P(P(AUB)). So 
(z,y) € P(P(AUB)) foralla € A and ye B. 


Lemma 3.1.4. Let 4 and # be sets. There exists a unique set D such that for 
al y, 
Y €D iff Y = (2,y) for somex€ A andy€ B. 
(3.3) 


We shall let {(a, y) : 2 € A and y € B} denote the set D. 


Proof. Let A and BP be sets. If Y = (w,y) for some a € A and some 
y€ B, then Y € P(P(AUB)) (see above). Since P(P(AU B)) is a set, 
Theorem 2.1.3 now implies that there exists a unique set —D satisfying (3.3). LJ 


Definition 3.1.5. Given sets A and J}, the Cartesian product A x J is defined to 
be the set 


Ax B= {(z,y):x€Aand ye B}. 


In other words, A x J is the unique set, given by Lemma 3.1.4, that consists of the 
ordered pairs ‘z. y) such that € A andye B. 


Example 1. Let A={2,3} and let B= {a,b,c,3}. Then 


Ax =4t2. a), (2,0), Aer, (2,5), ya), 3,0), X82), 33ers 


Exercises 3.1 


1. Define (a, bh. c) ((a, b) . c) for any sets @, ), @ . Prove that this yields 


an ordered triple; that is, prove that if (x, Y, z) (a, b. ¢ 2) , then @ a 
,Y bh ,and 2 Cc: 

2. Prove that (AU B) x C= (Ax C)U(BxC). 

3. Prove that (A \ B) a Bi (A x C) \ (B x C). 

4. Prove that (JF)xC U{AxC:AeEF}. 


3.2 Relations 


We now discuss the mathematical theory of relations. We are already familiar 
with the three relations gq — (equality), @< (less than), and X¥ CY 
(subset). Many of the fundamental concepts of mathematics can be described in 
terms of relations. In this section, we shall identify a relation as a set of ordered 
pairs and explore various properties that a relation may possess. 


Definition 3.2.1. A relation /? is a set of ordered pairs. 


Thus, a set is a relation if each of its elements is an ordered pair. In other 
words, a set is a relation iff for each 7 € A, there is an a and # such that 
x = (a,b) . Definition 3.2.1 does not put any restrictions on the ordered pairs 


that can appear in a relation. 


pe 


Definition 3.2.2. Let A and #3 be sets. A relation F? from A to #3 isa subset of 
Ax B;thai, RCAXB. 


Let A = {2,3} and let B= t4,6;6;3} . Then 


R= tie); (200) GOs (50) 4S 


is an example of a relation from 4 to #. One can also show that the empty set 
is also a relation (see Exercise 1). 


Given a formula y(a,y) and sets 4 and 3, the subset axiom implies that 


one can construct a_ relation R by forming the set 
R=iis.g)€ Ax Bole; 4)}- 


Example 1. Let 4 and # be sets. Consider the following relations: 
(1) R={(z,ys\E€ Ax B:z2=y}. 


{ 
(2) R={(xz,y)E€ Ax B:xEey}. 
3) B={ieyy eA x Bie Cy}: 


Definition 3.2.3. A relation ? on A isasubsetof 4 x A ;thatis, R a AxA. 


Example 2. Let A = {2,3}. Then the set R = {(2,2), (2,3), (3,3)} is a 
relation on A. 


Example 3. Let N = {0,1,2,3,...} and define the relation R on the set N 


by R={(rz,y)ENxN: (SREN)\(xa=yk)}. So (6,3)ER and 


For any set A , the relation Ig ={(z,y) €EAxXA:z=y} 
is called the identity relation on A . 


We know that every relation is just a set of ordered pairs. Can every relation 
also be viewed as a relation on a set? That is, given any relation R , can we find 
a set A and prove that RC Ax A? The following lemma answers this 
question. 


Lemma 3.2.4. Let R be arelation andlet A=|J\J)R.Then RC AXA. 


Proof. Let R be a relation and let A= |). Clearly, 4 is a set by the 
union axiom. We prove that RCAxA. Let (a,b)ER. So 
(a,b) = {{a},{a,b}}. Since {{a}, {a,b}} € R, we see that {a}e UR 
and {a,b} €UJR. Now, since {a} € JR, we conclude that ae DUR. 
Similarly, we infer that bE |J|JR. Hence, as A =|J|JR, we have that 
a€A andbeA.So (a b)e AX A.U 


Let R be a relation and consider the class {x : dy((a,y) € R)}. Is this 
class a set? If Sy((x,y) € R), Lemma 3.2.4 implies that « € JU FR. Since 
UU R is a set, Theorem 2.1.3 implies that {a : dy((z,y) € R)} is aset. A 
similar argument shows that {y : Sa (a,y) € R} is also a set. We can now 
define the sets dom(/?) and ran(/2) for any relation R . 


Definition 3.2.5. Suppose /? is a relation. 


1. The domain of R., denoted by dom( R ) ; is 


dom(R) = {a : dy({x,y) € R)}. 

2: The range of FR, denoted by ran(R) , is the _ set 
ran(R) = {y: de((z,y) € R)} . 

3. The field of J? is the set fld(R) = dom(R) Uran(k). 


Consider the relation 
R= {(a,b) € Ax B:a evenly divides b and a < b} where 
A= {0,1,2,3,4,5} and B= {4,6,7}. Let us evaluate the sets dom(R) 
and ran(R). We first observe that 


R = {(1,4), (1,6), (1,7), (2,4), (2,6), (3, 6)}. 


Hence, dom(R) = {1,2,3} and ran(R) = {4,6,7} . 


3.2.1 Operations on Relations 


Let R be a relation. In our next definition, we define ,~! to be the class 
R71 = {(v,u) : (u,v) € R}. 


We now show that R~! is a set. Since ran(/) and dom() are sets, we 
conclude from Lemma 3.1.4 that ran(R) x dom(R) is a set. Because 
ft += fe zpla)} where (x) is the formula 
Jude(a2 = (v,u) A (u,v) € R), it follows that for all a, if (a), then 
x € ran(R) x dom(R). So, by Theorem 2.1.3, R~! is a set. 


Our next definition is usually applied to functions, but it is also applicable to 
relations. We will see in Section 3.3 that functions are just relations that satisfy a 
special property. 


Definition 3.2.6. Let J? and be relations. 


R7! = {(v,u) : (u,v) € R}. 


(1) The inverse of /? is the relation 


Thus, (y, 2) € Ro? iff (x, yvER. 


(2) For each set A, the restriction of #2 to A is the relation 


R| A= {(u,v): (u,v) € RAu€e A}. 


Consequently, con y) ER| A iff te y) ERAZEA. 


(3) For any set A, the image of A under Qf is the set given by 
R[A] = {v : (au € A)((u,v) € R)}. 


Therefore, y € R[A] iff (Ar € A)({z,y) € R). 


(4) For a set #8, the inverse image of # under Rf is the set 
R7"[B] = {u: (au € B)((u,v) € R)}. 


Hence, 2 € R~1(B] iff (Ay € B)((z,y) € R). 


(5) The composition of R and $§ is the _ relation 
RoS = {(u,v) : dt((u,t) € SA (t,v) € R)}. 


Consequently, (a#,y) € RoS iff St((a,t) E SA (t,y) € R). 


As noted earlier, 7-1 is a set for every relation 2 . Similar reasoning shows 
that the above R{A, RA], R-'[B], and RoS are also sets (see 
Exercise 7). 


Example 4. Consider the relation © on the set P of all people defined by 


C= {(z,y) € Px P: 2 isa child of y}. 


Then C-1= {{y,z) € Px P: y is a parent of x} and 
CoC = {{z,z) € Px P: zis a grandchild of z}. 


If A is the set of all people bom iin _ Australia, then 
C~'|A] = {a € P: x has a parent who was born in Australia}. 


Now consider the relation ' ¢ on P given by 
L={(u,v)€ Px P:ulovesv}. To evaluate [,.o@-1, we obtain 
(a,b)€ LoC iff (a,t)}EC 1A(t,b)EL  forsomet 

iff (ta)ECAt,b) EL for some ft 


iff a has a child who loves b. 


Theorem 3.2.7. Let R , S , and 7’ be relations. Then 


(1) dom(R~*) = ran(R). 

(2) ran(R~') = dom(R). 

(3). {iec4y“* = Fe. 

(4) (Ro 9)-? =S"*0 FR“. 
(5) Ro(SoT)=(RoS) oT. 


Proof. The proofs of items (1)-(3) are reserved for the exercises. We will prove 
items (4) and (5) by appealing to propositional logic laws. 


(4) We shall prove that (Ro S)~!' = S~!o R~!.Let x and y be given. Then 


(x,y) €(RoS)! iff (y,z)eERoS Def. 3.2.6(1) 
iff (y,thESA(t,x)ER for some t, by Def. 3.2.6(5) 
iff (¢,27) ERAy,thES commutative logic law 
iff (a,t)e R-'A(t,y) ES Def. 3.2.6(1) 
iff (c,y)€S-'oR™” Def. 3.2.6(5). 


Therefore, (Ro S)-'=S-'toR™. 


(5) To prove that Ro(SoT) = (RoS)oT*, let « and y be given. Then 
(x,y) € Ro(So al iff (z7,u)E€SoTA (uy) ER for some u 
ff ({x,w) € TA (w,u) € S)A (u,y) € R for some w 
iff (x, w) ETA((w, u) ES)ACu, y) ER) associative law 
iff (a,w) ET A(w,y)€ RoS Def. 3.2.6(5) 
(: 


iff (z,y) € (RoS)oT Def. 3.2.6(5). 


Therefore, Ro(SoT)=(RoS)oT.U 


Let R be a relation and let G be a set (of sets). So F||_)G| can be viewed as 
the image of a union. In Exercise 8, one is asked to prove that the collection of 
images {R[(C]:C EG} is a set. Thus, J{ RIC]: C EG} is the union of 
images, where y € LJ{R[C]: C € G} iff y € RIC] for some C € G. In our 
next theorem, items (1) and (2) show that “the image of a union is the union of 
the images,” whereas items (3) and (4) proclaim that “the image of an 
intersection is a subset of the intersection of the images.” In the proof of this 
theorem, we will be using the quantifier distribution laws (QDL), namely, 
1.3.5(3) and 1.3.5(4) on page 19. 


Theorem 3.2.8. Let R be a relation. Suppose that A and # are sets. In 
addition, suppose that G is a set (of sets). Then 


(1) R[AU B] = R[AJU RIB). 


(3) R[AN BI C R[A]N RIB). 
(4) RING] CA{RIC]: C € G} when Gg Zo. 
(5) R[A]\ R[B] C RIA \ B). 


Proof. We prove only items (1) and (3). The proofs of (2), (4), and (5) will be 
left to the reader (see Exercises 8, 19, 20, and 21). Let R be a relation and let 
A and Bf be sets. 


(1) We shall prove that R{ AU B| = R{A| U RiP]. Let y be arbitrary. Then 


y€ RI|AUB| iff Ge € AU B)((a,y) € R) Def. 3.2.6 
iff (Ar € A)((x,y) € R)V (Ar € B)((z,y) € R) QDL 1.3.5(3) 
iff ye RIA] Vy € R[B| Def. 3.2.6 
iff y€ R[A|UR[B] definition of U. 


Therefore, R[A U B] = R[A] U RIB]. 


(3) To prove that R|AN B] C R|A]N RB], let y € R|AN B]. Observe that 


y € RIAN B iff (Aw € AN B)((z, y) € R) Def. 3.2.6 
=> (dar € A)((z,y) € R)A (Aaa € B)((z,y) € R) QDL 1.3.5(4) 
iff ye R[A]Ay € R[B| Def. 3.2.6 
iff y€ R[A|N RB} definition of N. 


Hence, y € R[A] NM R[B]. Therefore, R[AN B] C R[A]N R[B).0 


Is there a particular property so that if a relation 2 satisfies this property, 
then equality will hold for items (4) and (5) of Theorem 3.2.8? Our next 
corollary shows that these two items can be expressed as equalities whenever R 


happens to be a single-rooted relation. 


Definition 3.2.9. A relation /? is single-rooted if for each y € ran(R) there is 


exactly one 2 such that ‘z, y) ER. 


Thus, if R is a single-rooted relation, then whenever (x,y) € R and 
(z,y) € R, wecan conclude that x = z. 


Corollary 3.2.10. Let 2 be a single-rooted relation. Suppose that 4 and #9 are 
sets and that G is anonempty set. Then 


(1) R[A] \ R[B] = R[A\ B}. 
(2) RING] =N{RIC]:C eg}. 


Proof. Let 2 bea single-rooted relation. 


To prove (1), let 4 and f# be sets. By Theorem 3.2.8(5), we need to prove 
only that R[A \ B] C RIA] \ R[B]. Let y € R[A \ B]. Thus, (a#,y) € R for 
some x€ A\B.Soxve A andxa€B. Since x € A and (x,y) € R, we 
have y € R|A|. We now show that y ¢ R{B]. Assume, to the contrary, that 
y € R[B|. Therefore, (b,y) € R for some b € B. Hence, (b,y) € R and 
(x,y) € R. Because RF is single-rooted, we must have that z7 — and so 
x € B, which is a contradiction. Thus, y ¢ R{B]. Therefore, y € R{A] and 
y € R[B|. We conclude that y € R{A] \ R[B}. 


To prove (2), let G be a nonempty set. By Theorem 3.2.8(4), we just need to 
prove that {RIC :CEG}C RIN G|. Let ye OL RIC] :C EG}. Thus, 
we have that y € R|C| for every C € G . In other words, 


for every C € G, there is an x € C such that (z,y) € R. 


(3.4) 
Claim. There is a q such that d €{)G and (d,y) E R. 


Proof of Claim. As G is nonempty, let D € G. By (3.4) there isa dE PD so 
that (d,y) € R. We now show that this particular q is an element of every 
CeéegG.Let C €G. By (3.4) there exists an x € C so that (x,y) € R . Thus, 
(x,y) € R and (d,y) € R. Because R is single-rooted, we conclude that 
a — d . Hence, d € C and therefore, d € () G and (d,y) € R.(Claim) UO 


The above claim implies that y € R|{)G| and this concludes the proof of (2). 
The proof of the corollary is now complete. LI 


3.2.2 Reflexive, Symmetric, and Transitive Relations 


The symbols # and ~ shall be used to denote relations. Let R and ~ be 
relations on a set 4. If @eé A and hE A, we will write g@ Rp) to mean that 
(a,b) € R.. When we use the notation @ Rh , we shall say that “a is related to 
}.” Similarly, we write q@ ~ } to mean that (a,b) € ~ and say that a is 
related to ). We also write a ~% } to assert that “a is not related to } .” 


Relations that have properties similar to those of equality appear frequently in 
mathematics. We now recognize three fundamental properties of equality. For 
quantities x, Y, and z, we have the following: 


1. x =@2 (reflexive). 
2. If «= y, then y = & (symmetric). 
3. If © = yY and Y = Z, then x = z (transitive). 


In this section, we will investigate relations that share some, or all, of the above 
three properties. In our next definition, we shall identify what it means for a 
relation to be reflexive, symmetric, and transitive. Relations that possess one or 


more of these properties occur naturally in many areas of mathematics, including 
set theory. 


Definition 3.2.11. Let ~ be arelationon A . Then 


(1) ~ is reflexive if (Va E A)(a wy £): thatis, @ ~ & forevery 7 € A. 


(Va € A)\(Vy E A\(a~r~yoyr~a). 


(2) ~ is symmetric when 


In other words, if & ~~ Y, then Y ~ © whenever wy € A andy € A. 


(3) ~ is transitive if 
(V2 € A)(Vy € A)(Wz € A)[(a~yAyrwz)aar~zel. 


~ 


That is, if © ~ y and y ~ Zz, then x ~ z whenever Z,¥Y,2 areelementsin A. 


For a relation on A , the above properties can be expressed as follows: 


1. The relation is reflexive if every element in the set A is related to itself. 

2. The relation is symmetric if whenever x is related to y, then y is 
related to x. 

3. The relation is transitive if whenever x is related to Y and ¥Y is related 
to z, then x is also related to z. 


3.2.3 Equivalence Relations and Partitions 


Because the equality relation is so useful and essential, mathematicians have 
generalized this concept. A relation shall be called an equivalence relation if it 
satisfies the three key properties that are normally associated with equality. 


Definition 3.2.12. A relation on a set A is an equivalence relation on 4 (or just an 
equivalence relation when A is clear from the context) if it is reflexive, symmetric, and 


transitive. 


——————————————————————————————— 


An equivalence relation allows one to connect those elements of a set that 
have a particular property in common. One can show that each of the relations 
below is an equivalence relation: 


1. The relation ~ on the set Z defined by m~ 7 if and only if m—~n is 
even. 

2. The relation ~ on the set R defined by & ~ y if and only if |a| = |y| . 

3. The relation ~ on the set Z defined by m~n iff 3 evenly divides 


(m—n). 


A partition of a set breaks up the set into disjoint and nonempty subsets. In 
other words, a partition of a set A is a set of nonempty disjoint subsets of 4 
whose union is equal to 4. Figure 3.1 portrays a set 4 that is broken up into 
five subsets X,Y,U,V,W. Thus, P= {X,Y,U,V,W} is a partition of A 
because every element of A is in one of the nonempty sets X,Y,U,V,W and 
any two of these sets are disjoint. The following definition formalizes this notion 
of breaking up a set into disjoint parts. 


Figure 3.1. A partition of the set 4. 


Definition 3.2.13. Let A bea set. Let P be a set of nonempty subsets of 4 . We say 
that P isa partition of A _ ifthe following two conditions hold: 


1. For every element q € A, thereisaset S € P suchthatq Ee S. 
2. Forall § and J’ in P,if S sf T thn SNT = OS. 


Item 1 of Definition 3.2.13 asserts that every element in 4 belongs to a set in 
the partition P. Item 2 states that any two different sets in P are disjoint. When 
this occurs we say that the sets in P are pairwise disjoint. Thus, a partition of a 
set A divides the set into nonoverlapping parts that, together, cover all of 4. 


The main result that we will soon establish is that an equivalence relation on a 
set 4 induces a partition of 4. Therefore, an equivalence relation can be used 
to create a new set from an old one. For each q € A, we must first form the set 
of all those elements in 4 that are related to a. 


Definition 3.2.14. Let ~~ be an equivalence relation on 4 . Let @ be anelementin A . 


The equivalence class of @ , denoted by (al ~,, is the set of all elements in 4 that are 


related to @, namely, [a]. = {a € A:a~a}. 


By the subset axiom, an equivalence class is a set. In Definition 3.2.14, we 
shall write |a| = [a]. whenever the relation ~ is understood. 


Example 5. Let ~ be the equivalence relation on JR defined by « ~ y if and 


only if |a|—=|y|. Then the equivalence classes |1], |—1], and [2] are 
(lj ={z#eER:a~1}={reER: |2| = |1|} = {1,-1}, 
(-1]={zE€R:a2~-—1} = {x ER: |2| = |-1|} = {-1, 1}, 


(2]} = {# ER: 2~2}= {2 ER: |a| = |2|} = {2, -2}. 


Hence, |1] = [—1] and [1] # [2]. In addition, observe that [1] M [2] = o . 


Remark 3.2.15. Let ~ be an equivalence relation on 4 and let @ € A. Then 
[a] C A, and moreover x € [a] if andonlyifaw~a. 


Example 6. Let ~ be the equivalence relation on 7 defined by m ~ n if and 
only if m —n is even. Thus, [k] = {mE€Z:m~k}={meEZ: m—k is 
even } , for each integer 4. We evaluate the equivalence classes {1|, [2], and 
(3] as follows: 
(1] = {m €Z:m-—1 is even} = {..., —5, —3, -1,1,3,5,7,...}. 


(2} = {m € Z: m— 2 is even} = {..., —6, —4, —2,0,2,4,6,8,... }. 
[3] = {m € Z: m— 3 is even} = {..., —5, —3, -1,1,3,5,7,... }. 


Therefore, [1] = [3] and [3] 4 [2]. Furthermore, [3] [2] =o. 


For any equivalence relation, our next theorem shows that two elements are 
related if and only if their equivalence classes are equal. 


Theorem 3.2.16. Let ~ be an equivalence relation on 4 . Then for all @€ A 
andbhe A, @~ b if and only if [a] = |b]. 


Proof. Suppose that ~ is an equivalence relationon 4.Leta@é A andbe A 
. We shall prove that q ~ if and only if |a| = {b|. Assume qa ~ } . We prove 
that [a] = |b]; that is, we prove that these two sets are equal. Let x € [a]. We 
shall show that a € |b]. Since ax € |al, it follows that « ~ a. By assumption, 
we also have that gq ~ ) . Thus, # ~ a and q ~ f). Because ~ is transitive, we 
conclude that a ~ ) and hence, x € |b|. So, [a] C {b]. A very similar argument 
shows that {b] C Ja]. Therefore, [a] = |b]. To prove the converse, assume 
[a] = [b]. Since a € [a], we see that a € [b]. Hence, a~h.O 


We now present a useful corollary. 


Corollary 3.2.17. Let ~ be an equivalence relation on a set A. For any a and 
b in A, we have that a € [b| if and only if [a] = |b}. 


Let ~ be an equivalence relation on a set 4. Theorem 2.1.3 implies that the 
collection of all equivalence classes {x : (da € A)(ax = |a|)} is a set. This set 
is denoted by {|a] : a € A}. In our next theorem, we show that the set of all 
equivalence classes forms a partition of A . Thus, an equivalence relation on a 
set can be used to break up the set into nonempty subsets that do not overlap. 


Theorem 3.2.18 (Fundamental Theorem of Equivalence Relations). Let ~ 
be an equivalence relation on 4 . Then the set P = {{a]: a € A} isa partition 
of A. 


Proof. Let ~ be an equivalence relation on A. We shall prove that the 
collection P = {{a] : a € A} isa partition of A . To do this, we show that (1) 
for every element « € A, we have that x € {a}, and (2) for all x € A and 
y €A, if [x] F [y], then [2] MN [y] = @ . For (1), let ze € A. Clearly, [a] € P 
and x € |x|, because ~ is reflexive. Now, to prove (2), letxE A, yEA, 
and assume |x| M |y| 4 @ (we are using proof by contraposition). Thus, there is 
aze€A such that z € |2| and z € [y]. Therefore, [z] = [a] and [z| = [y] by 
Corollary 3.2.17. So [a] = [y]. 0 


Definition 3.2.19. Let ~ be an equivalence relation on A . Then A / ., denotes the 


partition {[a] [ae A} of A and is called the quotient set induced by ~ . 


An equivalence relation ~ on A breaks up the set A into disjoint subsets, as 


17. TF 
illustrated in the following figure: [a] [x] [c] [a] --- 


Mathematicians frequently use the quotient set A/. to learn something new 
about the set 4 itself. 


Example 7. Consider the equivalence relation ~ on 7 defined by m~ n if 
and only if (m— mn) is evenly divisible by 3. For all integers m and 7, it 


follows that m~ n if and only if m = 3k +n for some k € Z. Therefore, 
[pn] ={meEZ:m~n}={38k+n:k EZ} 


for each integer mn. We can now evaluate the equivalence classes {()|, {1], and 
[0] = {8k:k EZ} ={...,-9, —6, —3,0,3,6,9,...}. 


[1] = {8k +1:k EZ} ={...,-8,—5,—-2,1,4,7,10,... }. 
(2): (2) = {83k +2:k EZ} ={...,-7,-4,-1,2,5,8,11,...}. 


The quotient set Z/. is {{n|:nm€ Z} = {0}, [1], [2|} and is illustrated in the 
figure: 


[0] (1) (2) 


Exercises 3.2 


*1. Explain why the empty set is a relation. [Hint: Suppose @ is not a relation and 


derive a contradiction. ] 
2. Prove items 1—3 of Theorem 3.2.7. 
3. Let J? bearelation. Prove thatif 4 C 3, then R{A] sa RB]. 
4. Let R be the relation defined by 


R= £10, 1), (0, 25,42; 0), (2,3); 44,4). 428).18, S40 9)11 3) 43, 2h). 


Evaluate dom(R), ran(R), RoR, R| {1}. Ro] {1}. RL], 
and R~"({1}]. 


5. Suppose that R is a relation. Prove that 
R | (A U B) = (R | A) U (R | B) for any sets A and f}. 

6. Let /? be arelation. Prove that fld( R) = UU U i 

*7. Let #2 and § be two relations and let A , J}, and (’ be sets. Prove that 
R{[A,.R "STR. RC}, and RoS aresets. 

*8. Let J? be a relation and G bea set. Prove that { R/C] -Ce G} is a set. 
Prove that if G is nonempty, then { R [C | “CE G} is also nonempty. 

9. Let fF be a relation on 4. Suppose that dom(R) — A and 
R-10oR C FR. Prove that JR is reflexive on A . 

10. Let #2 bearelationon A . Prove that J? is symmetric if and only if Ro! a ae 
11. Let JR bearelationon A . Prove that J is transitive ifandonlyif Ro RC R 


12. Let #2 bearelationon A . Prove that /? is symmetric and transitive if and only if 
R1oR=R- 

13. Arelation #2 on A is said to be antisymmetric if, for all g € A and y € A. 
whenever (x, y) € R and (y, L) € R, then © = Y. Prove that a relation FR 
on A is antisymmetric if and only if Ro R~! C J (see page 44). 

14. Let G be a nonempty set of transitive relations on A . Prove that the relation () G 
is transitive. 

15. Let J? bearelation on A . Prove that if J? is an equivalence relation on A , then 
R- 1 is also an equivalence relation on A . 

16. Let /? be an equivalence relation on A . Prove that Ro R= RR. 

17. Let #2 and § be relations on 4 . Suppose that /? is reflexive on A . Prove that 
SC HoS andSCSoR. 

18. Let #2 and § be a reflexive relations on A . Suppose that /? is also transitive. 
Prove S C FR ifandonlyif SoR=R. 

* 19. Prove item (2) of Theorem 3.2.8 (see Exercise Notes). 

* 20. Prove item (4) of Theorem 3.2.8. Observe that y € OL RIC] -CeE G} iff 
ye RIC] forevery C'E G. 

“21. Prove item (5) of Theorem 3.2.8. 

22. Let A? and § be single-rooted relations. Prove that ff o §’ is single-rooted. 

23. Let #2 and § be relations. If dom( R) = ran(S) and Ro S’ is single- 
rooted, then prove that S’ is single-rooted. 

24. Let P bea partition of a set 4 . Now define the relation ~ on A as follows: 
x ~ y if and only if z € C and y € C for some C € P. Prove that 
e~ isan equivalence relation on A . 


Exercise Notes: For Exercise 19, note that ye J{RIC]:C eg} iff 
y € RC] for some CegG. and 


ye RG] iff Gr eJ9) (x,y) ER 
iff Sa(a € L¢ A (x,y) € R) 
iff Sa(AC € G)(x ECA (x,y) € R) 


iff (AC € G)Aa(a EC A (x,y) € R), 


recalling the quantifier interchange law 1.3.4(1). For Exercise 20, note that 


ye Rif) G| iff (Ar € (19) (z,y) ER 
iff Sa(a € ()¢ A (x,y) € R) 
iff Sa(VC € G)(x# ECA (2, y) € R) 
=> (VC € G)ar(x# €E CA (2, y) € R), 


recalling the quantifier interchange law 1.3.4(3). For Exercise 21, observe that 
y € R[A] \ RIB] iff y € RIA] Ay ¢ RIB] 
iff (ax € A) (z,y) Ee RA-7A(Ar € B){z,y) ER 
=> (dr € A\ B) (2, y) € R. 


Exercise 24 and Theorem 3.2.18 connect the seemingly unrelated concepts of an 
equivalence relation on aset A anda partitionon A. 


3.3 Functions 


One of the most important ideas in modern mathematics is the concept of a 
function. Moreover, the function concept is one that is used extensively outside 
of mathematics. A function is a way of associating each element of a set 4 with 
exactly one element in another set 8. For example, f(x) x” is a function 
that associates each real number 2 with the real number ,-2. The functions 
studied in a first calculus course have the set JR of real numbers, or subsets of 
R , as domain and codomain. In this chapter, we will look at functions in a more 
general context and will examine some important properties that a function may 
possess. To do this, we must first give a precise set-theoretic definition of a 
function. 


Definition 3.3.1. A relation J? is single-valued if for each x2 € dom( R) there is 
exactly one ¥ such that ig, y) ER. 


Po 


Thus, if R is a single-valued relation, then whenever (x,y) € R and 
(a,z) € R,wecan conclude that y = 2. 


Definition 3.3.2. A function / is any single-valued relation; that is, for each 


LE dom(F’) there is only one ¥ such that iz. y) Ee F’. 


Since a function is also a relation, all of the results and exercises presented in 
Section 3.2 (excluding equivalence relations) apply to functions as well. 


Let A and # be sets. A function F from A to PB is asubset of Ax B 
such that for each x € A there is exactly one y € B so that (2, y) € F. We 
now express this notion in terms of a formal definition. 


Definition 3.3.3. Let A and #3 be sets, and let # be arelation from 4 to J}. Then 
F’ is said to be a function from 4 to /3 ifthe following two conditions hold: 


(1) dom(F’) A ; that is, for each x € A, there is a ye Bssuch that 


(2, y) € F. 
(2) F is single-valued; that is, if (x, y) € FF and (x, z) EF ,theny—2z 


The set A is the domain of J’ andthe set /} is called the codomain of /’. 


Example 1. Let A= {a,b,c,d,e} and B= {5,6,7,8,9}. Then 
F = {(a,8) , (b, 7) , (c, 9) , (d, 6) , (e, 5)} 


is a function from A to # because for each y € A, there is exactly one 
y € B such that (2, y) € F’. On the other hand, the set of ordered pairs 


G = {(a,8) , (b, 7) , (c, 9) , (d, 6) , (b, 8) , (e, 5) } 


is not a function from 4 to B because (b,7) € g and (b,8) € g, but 7/8. 
Hence, item (2) of Definition 3.3.3 fails to hold. In addition, the set 


H = {(a, 8), (b, 7) , (c, 9) , (e, 5)} 


is not a function from 4 to RB because qd € A and there isno y € B such that 
(d,y) € H . So item (1) of Definition 3.3.3 does not hold. 


We write #’': A — B to denote that Ff is a function from the set A to the 
set 3. Thus, for each a € A, there is exactly one y € B such that (x,y) € F 
. This unique ¥ is called “the value of # at a” and is denoted by F(x). 
Therefore, (a,y) € F if and only if F(a) = y. The value F’(x) is called “F 
of x ” or “the image of 2 under /’.” In addition, one can say that a € A is an 
input to the function / and that F’(2) is the resulting output. Moreover, one 
can also say that the function / maps x to F(a), denoted by «++ F(x). 


When we are given a function /’: 4 —+ 8B, we know that each x € A is 
mapped to exactly one element F(a) in B , because /’ is single-valued. 


Remark 3.3.4. Technically speaking, when F’ is a set of ordered pairs, one can 
use the notation /’(a) only when it is known, or it is clear, that /’ is a function. 


Consider the set of ordered pairs F = {(x,y) €ERxR:y=22°+1}. 
One can easily show that F’ satisfies the conditions of Definition 3.3.2. Thus, 
is a function, and we can write F(a) 22% +1, foralz eR. 


When A and § are finite sets, then a function -#': 4 -+ B can be 


represented by drawing an arrow from each element z¢€A_ to the 
corresponding element F(a) € B (see Figure 3.2). Such a drawing is called an 
arrow diagram. These diagrams can help one to gain a better understanding of 
the function concept and the concepts of domain and codomain. 


Figure 3.2. Arrow diagram of the function given in Example 2. 


Example 2. Let A= {a,b,c,d} and B= {1,2,3,4,5}. Consider the 
function F; A — B defined by F = {(a, 3) , (b,5) , (ec, 1) , (d, 3)} . Thus, 


Fia)=3, F(b)=5, F(c)=1, and Fld) =. 


Since A and § are finite, we can illustrate the function / by means of the 
arrow diagram in Figure 3.2. 


Clearly, each element » € A is mapped to exactly one element /’(2) in B. 
Observe that a # d and F(a) = Fd). So it is possible for distinct elements in 
the domain to produce the same value under a function. In fact, many functions 
have this “repeated value” property. Such a function is not single-rooted. 


If # and G are relations, then / = G if andonly if / and G have exactly 
the same ordered pairs. The following lemma offers a useful tool for showing 
that two functions are equal. 


Lemma 3.3.5. Let # and G be functions such that dom(F’) = dom(G). Then 
F = G ifand only if F(a) = G(x) for all x in their common domain. 


Proof. See Exercise 1. 
Let A and B be sets. Consider the class 
{F’: F is a function from A to B}. Since a function fF: A+B is a 


subset of A x B, we conclude that every such function is an element in 
P(A x B); it thereby follows from Theorem 2.1.3 that this class is a set. 


Definition 3.3.6. Let 4 and /} be sets. The set of all functions from A to J} , denoted 


by AB , is defined by “B = {F': F is a function from A to B}. 


Some authors use the notation 4 for the set of all functions from A to B. 
The notation 43 is commonly used in set theory because the set 4 is read 


before #3 , emphasizing the fact that the set consists of all functions of the form 
f: AcoB. 


Example 3. Let w = {0,1,2,3,...} be the set of natural numbers. 


1. “{0,1} is the set of all functions f: w—> {0,1}. One can think of 
“{Q,1} as the set of all infinite binary sequences. 

2. “yw is the set of all functions f: w— w.One can think of “w as the set 
of all infinite sequences of natural numbers. 


Remark 3.3.7. The empty function is the function whose domain is the empty 
set. For each set #3, there is exactly one function f: @ —- B. Since 


f ¢ © x B and the product @ x B = @, this unique function f is just the 
empty set. In other words, the empty set © vacuously satisfies the condition 
given in Definition 3.3.2. Thus, @ is a function from @ to #, for any set PB. 
Hence, in Definition 3.3.6, “B = {@} for every set B, and 4g — g@ when 


A is nonempty. 


3.3.1 Operations on Functions 


Let /’ be a function. Since fF’ is a single-valued relation, we have that 
(x,y) € F if and only if F(a) = y. Therefore, we can slightly modify 
Definition 3.2.6 to obtain the following definition. 


Definition 3.3.8. Let /’ and (,' be functions. 
(1) The inverse of )’ is the relation F {(v, u) (u, v) a hi 


Thus, (y,2) € F-! iff (z,y) € F iff F(x) = y. 


(2) For each A, the restriction of f' to A is the function 


F\/ A= {(u,v): (u,v) € FAue A} 


with domain dom(F’) - A . Therefore, (2, y) EFA iff F(z) yAxEA 


(3) For any A C dom(F’), the image of 4 under F is the subset of ran(/’) 


sivenby F|A] = {F(a): 2 € A}, 


where {F'(a) : a2 € A} is equal to the set {y: y = F(x) for some x € A}. 
Hence, yE€ FA] iff (Qae€ A)(y=F(x)). So, if weA, then 
F(a) € FA]. 


(4) For aset #8, the inverse image of #8 under F is the subset of dom(/’) 
defined by ¥ '(B] = {u: F(u) € B}. 


Therefore, « € F~"[B] iff F(x) € B. 


(5) The composition of fF and G is the _ relation 
FoG = {{u,v) : dt((u,t) € GA (t,v) € F)}. 


Consequently, (x,y) € FoG iff 3t({z,t) E GA (t,y) € F). 


Remark 3.3.9. We make a few notes concerning Definition 3.3.8. 


(a) Let # be a function. In general, f~-1 is not a function, but it is a 
relation. 

(b) Suppose that /’ is a function and let A C dom(f’). Then F' | A isa 
function with domain A . 

(c) Given functions /’ and G, we will prove below that the composition 
F’oG isa function. 

(d) Asa function F’ is a single-valued relation, the range of F’ is the set 


ran(F’) = {F (x): 2 € dom(F)}. 


Theorem 3.3.10. Let / and G be functions. Then 


(a) / oG isa function. 
(b) dom(F' o G) = {a € dom(G) : G(x) € dom(F)} . 
(c) Forall « € dom(F' o G), we have (F' 0 G)(x) = F(G(a)). 


Proof. Let / and G be functions. We shall prove statements (a), (b), and (c). 


(a) To prove that the relation foG is a function, let « € dom(foG). 
Assume (x,y) € (FoG) and (a,z) € (F oG). We shall prove that y = 2. 
Since (x,y) € (F'oG) and (x, z) € (F' 0G), there are s and ¢ such that 


(x,t) € Gand (t,y) € F, 
(x,s)€ Gand (s,z) € F. 


Hence, 
(x,t) € G and (2,8) €G, (3.5) 
(t,y) € F and (s,z) € F. (3.6) 


Since G is a function, we conclude from (3.5) that ¢ = s. So from (3.6) we 
obtain (t,y) € F and (t,z) € F. Because /’ is a function, we infer that 
=. 


(b) Let w be given. Because G is a function, we see that St((u,¢) € G) if and 
only if (u,G(u))€G. We have the following equivalences: 


u€dom(FoG) iff (uy) € FoG for some y 
iff (u,t) EGA tt, y) EF for some t 
iff (u,G(u)) EGA (G(u),y) E F t = G(u) 
iff w € dom(G) A G(u) € dom(F) definition of domain 


iff w € {a € dom(G) : G(x) € dom(F’)} definition of set. 
Therefore, dom(F’ o G) = {x € dom(G) : G(x) € dom(F’)}. 


(c) Let x€dom(FoG). Since FoG is a function, we see that 
(x,y) € FoG where y = (fF 0 G)(zx). It follows, as in the proof of (b), that 
(G(a),y)€F. Thus, y=F(G(x)), as F is a function. So 
(Fo G)(x) = y= F(G(2)).0 


Remark 33.11. If G:A+B, F:B+C, and H:C+D, 
Theorem 3.3.10 implies that (F'oG): A—C and (FoG)(x) = F(G(a)) 


for all 2 € A andthat where 


forall, (see Theorem 3.2.7(5)). 


3.3.2 One-to-One Functions 


Some functions (see Figure 3.2) may have two inputs that are assigned to the 
same output and thus, such functions have a repeated value. If a function has no 
repeated values, then we will say that the function is one-to-one. For example, it 
is easy to see that the function in Figure 3.3 is one-to-one. 


Figure 3.3. Arrow diagram of a one-to-one function 


Definition 3.3.12. A function is said to be one-to-one (or an injection), if and only if 


any two distinct elements in the domain of are mapped to distinct elements, or 


equivalently 


The following lemma is straightforward to prove (see Exercise 1) and will be 
used to establish an important consequence of our next theorem. 


Lemma 3.3.13. A function is one-to-one if and only if __ is single-rooted. 


Theorem 3.3.14. Let bearelation. Then 


(a) is afunctioniff — is single-rooted. 
(b) isafunctioniff is single-rooted. 


Proof. Since (a) and (b) are essentially the same statement, we just prove (a). 


( ). Assume isa function. Thus, — is single-valued. So to prove that _is 
single-rooted, assume that and .Wemust prove. Since and ,we 
have that and_, by the definition of . Because _ is a function, we infer 
that 


( ). Suppose that is single-rooted. To prove that is a function, assume 
that and .Weprove .Since and ,weconcludethat and_, by 
the definition of . Now, since is single-rooted, we conclude that . LJ 


Corollary 3.3.15. If is a one-to-one function, then is also a one-to-one 
function. 


Proof. Assume that the function _—_is one-to-one. Hence, __ is single-rooted by 
Lemma 3.3.13. Theorem 3.3.14(b) now implies that isafunction. As isa 
function, Theorem 3.3.14(a) allows us to conclude that is single-rooted. 
Therefore, is one-to-one by Lemma 3.3.13. LJ 


Theorem 3.3.16. Let bea one-to-one function. 


(a) If ,then 
(b) If , then 


Proof. Assume __is a one-to-one function. Corollary 3.3.15 implies that isa 
function. Since the proofs of (a) and (b) are similar, we just prove (a). Assume 
. Thus, .Hence, .So ,because isa function. LJ 


We note that the empty function vacuously satisfies Definition 3.3.12. Thus, 
is a one-to-one function from to, for any set 


Definition 3.3.17. A function maps onto when , that is, if (see 
Figure 3.4). Such a function is also referred to as a surjection. 


Figure 3.4. Arrow diagram of afunction _ that maps onto 


Theorem 3.3.18. If is a one-to-one function that maps. onto ,then is 
a one-to-one function that maps onto  ,and 


(a) for all 
(b) for all 


Proof. Let bea one-to-one function that maps. onto .Hence, and 
Since _ is one-to-one, Corollary 3.3.15 implies that —_is a one-to-one function, 
and, furthermore, Theorem 3.2.7 shows that and  .Therefore, is aone-to- 
one function that maps onto  . Theorem 3.3.16 implies items (a) and (b). LJ 


A function that is one-to-one and onto is called a bijection. 


Theorem 3.3.19. If the functions. and are one-to-one, then the composition 
is one-to-one. 


Proof. Suppose the functions and are one-to-one. To prove that the 
function is one-to-one, let and . Assume that . Thus, (i) by 
Remark 3.3.11. Since is one-to-one, we conclude from (i) that . Because 

is one-to-one, we see that . This completes the proof. 


Theorem 3.3.20. If isonto| and  isonto  ,thenthe composition is 
onto 


Proof. Assume  isonto and_ isonto  .Weshall prove that the function 

is onto .Let  . Since is onto and , there isa such that 
Because and  isonto ,thereisan  suchthat . We will show that 
as follows: 


3.3.3 Indexed Sets 


There are occasions when the range of a function is deemed to be more 
important than the function itself. When that is the case, both the 
terminology and the notation undergo radical alterations. 


— Paul R. Halmos [7] 


Let bea function with domain andrange_. To simplify our notation, let 
us write , foreach. It is sometimes useful to identify the function by 
using the indexed notation where is referred to as the index set, an 
element in is called anindex, and each value — of the function at an index 

is called a term. Thus, — will be said to have nonempty terms when __ for 
all . We shall say that — is an indexed function, and its range shall be 
called an indexed set. An indexed set is also referred to as an indexed family of 
sets. Whenever ,then is the restriction of the function to 


When we say that is an indexed set, we just mean that it is the range of the 
indexed function . Indexed sets appear frequently in mathematics. The union 
of an indexed set , written as  , appears routinely in mathematics. We now 
give a formal definition of this union. 


Definition 3.3.21. Let be an indexed set. Define 


that is, 


Thus, — consists of those elements that are in at least one of the sets . For 
example, suppose .Then , and 


Definition 3.3.22. Let be an indexed set where . Define 


that is, 


The intersection is composed of those elements that belong to each and 
every one of the sets 


3.3.4 The Axiom of Choice 


Suppose that we have aset that contains only nonempty sets. Is it possible to 
uniformly select exactly one element from each set in ? In other words, is 
there a function so that for each , we have that ? The following set 
theoretic principle will allow to answer this question. 


Axiom of Choice. Let — be an indexed function with nonempty terms. Then 
there is an indexed function suchthat , forall 


Definition 3.3.23. Let be an indexed function with nonempty terms. An indexed 


function shall be called a choice function for _, if for all 


Let beafunction where forall  . Suppose that there exists an explicit 
method for identifying a single element in _—_and that this method works for 
each . Then one can construct a choice function without appealing to the 
axiom of choice. Moreover, if is a finite set, then one can prove that there is a 
choice function (by induction) without using the axiom of choice (see 
Exercise 22 on page 116). On the other hand, there are times when the only way 
to obtain a choice function is by appealing to the axiom of choice. 
Mathematicians often use the axiom of choice when the index set is infinite and 
it is not clear how to construct a choice function. 


To better understand the axiom of choice, let us look at three examples where 
we can define a choice function without using the axiom of choice: 


1. is such that each is a nonempty set of natural numbers. For each 

, define to bethe least element in .Then isa choice function 
for. 

Z. is such that each is a nonempty finite interval of real numbers. For 


each ,let be the midpoint of the interval .Then _ isa choice 
function for 

a is such that each _ contains one pair of shoes. Foreach , let be 
the right-footed shoe in’ .Then _ isa desired choice function. 


In each of the above examples, we presented a method for identifying a single 
element in , and this method worked for each . Thus, we were able to 
uniformly select exactly one element from each set without using the axiom 
of choice. 


Now let _ be an infinite set and consider the following three examples where, 
in general, it is not possible to define a choice function: 


i; is such that each _ is an infinite set of real numbers. 
2. is such that each is anonempty set of functions. 
3. is such that each contains one pair of identical socks. 


It is not at all clear, in each of these three examples, how to uniformly select 
exactly one element from each set. . By the axiom of choice, however, there is 
such a choice function. (The socks/shoes examples are due to B. Russell.) 


The Zermelo—Fraenkel system of axioms is usually denoted by _, and the 
axiom of choice is abbreviated by. The axiom of choice was formulated by 
Ernst Zermelo; however, Zermelo and Fraenkel did not include this axiom in 
For this reason, the result of adding the axiom of choice to the system is 
denoted by  . There were some early attempts to prove the axiom of choice 
assuming only the axioms in. Since these attempts were not successful, 
mathematicians began to doubt the possibility of proving the axiom of choice, 
and eventually this was shown to be the case. The combined work of Kurt Gédel 
in 1940 and Paul Cohen in 1963 confirmed that the axiom of choice is 
independent of the Zermelo—Fraenkel axioms; that is, cannot be proven or 
refuted using just the axioms in . Nevertheless, the axiom of choice is a 
powerful tool in mathematics, and some important theorems cannot be 
established without it. Consequently, mathematicians typically assume the axiom 
of choice and usually cite the axiom when they use it in a proof. Our next 
theorem uses the axiom of choice in its proof, and we alert the reader to this fact 
by using the notation (AC) just before the statement of the theorem. 


Suppose that _is a set of nonempty sets. In the proof of our next theorem, we 
convert into an indexed set via a simple change in notation. Using the set 
itself as the index set, together with the identity function on , we shall 
reexpress _aS an indexed set. 


Theorem 3.3.24 (AC). Let be a set of nonempty sets. Then there is a function 
such that _ for all 


Proof. Let bea set of nonempty sets. Let be the identity function on _ ; 
that is, let for all . Let , for each . We can now rewrite as the 
following indexed function . By the axiom of choice, there is an indexed 
function suchthat forall .Define by ,foreach_ . Therefore, 
forevery .U 


The above function shall be called a choice function for . Informally, 
the axiom of choice declares that for any collection of nonempty sets, it is 
possible to uniformly choose exactly one element from each set in the collection. 


The statement of Theorem 3.3.24 actually implies the axiom of choice, and 
thus, the theorem is equivalent to the axiom (see Exercise 21). We will revisit 
the axiom of choice in Chapter 7. 


Exercises 3.3 


1. Prove Lemma 3.3.5 and Lemma 3.3.13. 


2. Let be a function and let| —_. Prove that 
3. Let be a function and let| —_. Prove that 
4. Let be one-to-one, ,and| .Proveif| then 
5. Let be a one-to-one function,) |and| .Provethatif , then 
6. Suppose that is a one-to-one function, and let| . Prove that 
7. Let 5 ,and| .Proveif , then we have that 
8. If isonto) , ,  4,and 4, prove 
9. Suppose that| isafunction. Provethatif and) , then 
10. Let and) befunctionsfrom) (to  .Suppose  . Prove that 
11. Let be a set of functions. Suppose that for all and in| , we have 


either or 
(a) Prove that is a function. 
(b) Suppose that each is one-to-one. Prove that is one-to-one. 


12. Let be a function. Let . Prove that 


13. Assume is onto . Let . Prove that 
14. Let and be functions. Prove that whenever 


15. Let be a one-to-one function. Define by ,foreach|  . Prove that is 
one-to-one. 
16. Let be a function. Define a relation on by ifandonlyif) , for all 
and in. Prove that is an equivalence relation on. Let be given. Describe 
the equivalence class 
17. Let be a function. Suppose that is a reflexive relation on _. Prove that for 
al. 
18. Let beafunction. Prove that} is a symmetric relation on ifandonlyif| — for 
al. 
19. Let be a function. Prove that is a transitive relation on if and only if for 
all 


20. Prove the following theorems: 
(a) Theorem. Let and be indexed sets with the same index set. . Suppose for 


all . Then 

(b) Theorem. Let and be indexed sets with the same index set| . Suppose for 
all . Then 

(c) Theorem. Suppose that is a set and that is an indexed set. Then 

(d) Theorem. Suppose that is a set and that is an indexed set. Then 


(e) Theorem. Suppose that is a set and that is an indexed set. Then 


21. Let be an indexed function with nonempty terms. Prove that there is an 
indexed function sothat forall  , using Theorem 3.3.24. 
22. Let} and _ . Suppose that forall thereisan  sothat  . Using the 
axiom of choice, show that there is an indexed function suchthat and _ , 
for all 


3.4 Order Relations 


The equality relation was generalized, in Section 3.2.3, by the introduction of an 
equivalence relation. An equivalence relation satisfies three key properties that 
are usually associated with the equality relation; namely, such a relation is 
reflexive, symmetric, and transitive. We shall now generalize the concept of 
“less than or equal to.” 


The familiar relation onthe set puts an order on the real numbers. This 
relation is reflexive and transitive; but, since and ___, the relation is not 
symmetric. Moreover, for and in ,if and ,then’ . We therefore 
say that —_is antisymmetric. We now present a general definition. 


Definition 3.4.1. A relation on a set is antisymmetric when 


thatis,if and ,then whenever and_ arein 


Let us now review the three core properties of the relation _ on the set of real 
numbers. For all we have the following: 


is (reflexive). 
2. If and ,then (antisymmetric). 
3. If and ,then (transitive). 


The above three properties are the ones that constitute the notion of order. We 
can now generalize the concept of “less than or equal to” and apply it to many 
different sets. To do this, we shall use the symbol to denote a relation. 


Definition 3.4.2. A relation on a set is a partial order if is reflexive, 


antisymmetric, and transitive; that is, for , and in, the following hold: 


and), then 
and), then 


If isapartial order onthe set, then we shall say that the structure isa 
partially ordered set or a poset.’ For example, _is a poset. 


Problem 1. Let bea set (of sets) and let —_ be the subset relation. Show that 
is a partially ordered set. 


Solution. Let , ,and_ besetsin’ . We verify the following three items: 


(1) __} 
(2) If and ,then 
(3) If and ,then 


Clearly, (1) is valid. If and ,then _ by the extensionality axiom. Thus, (2) 
holds. Finally, item (3) follows from Exercise 2 on page 35. 


Problem 2. For integers) and__, the divisibility relation meansthat for 
some .Let bea set of nonzero natural numbers. Show that _ is a partially 
ordered set. 


Solution. We show that the divisibility relation is reflexive, antisymmetric, and 
transitive on the set .Let bein  . We need to verify the following three 
items: 


(1) |_| 
(2) If and_ ,then 
(3) If and, then 


Since ,Wwehavethat . Items (2) and (3) can also be verified. 


Definition 3.4.3. Let be a partial order on _—.: The relation is a total order (or 


linear order) on if satisfies _; that is, either or), for every and in 


When  isaposetand isa total order, then we shall say that _is a totally 
ordered set. The poset __ is a totally ordered set. On the other hand, let and 
let be the divisibility relation. Then isaposet, butsince and _  ,wesee 
that is not a totally ordered set. 


Definition 3.4.4. Let be a poset. For and in’, we write if and only if 


and. The relation on shall be referred to as the strict order corresponding to 


A strict order is not a partial order, because it is not reflexive. This is verified 


by our next lemma. 


Lemma 3.4.5. Let bea poset, and let _ be the strict order corresponding to 
.Then forall , ,and  in_ , we have the following: 


de eth 

2. If , then 

3. If and ,then 

4. Let beatotal orderon  . Then exactly one of the following holds: 
,or ,or 


Proof. See Exercise 2. LJ 


So, if — is a totally ordered set, then Lemma 3.4.5(4) implies that the strict 
order corresponding to _ satisfies the trichotomy law on __ ;; that is, for all 
and in , exactly one of three alternatives holds: ,  , or 


Let beapartial orderonaset .If , then we will say that is smaller 
than andthat is larger than 


Definition 3.4.6. Let be a partial order onaset| . An element is called a maximal 
element if andonlyif| forall) ;thatis,thereisnoelementin that is larger than 


Definition 3.4.7. Let be a partial order on a set| . An element is said to be a 


minimal element if and only if for all ; that is, there is no element in that is 


smaller than 


Example 3. Let . Thus, are minimal elements in the poset (see 
Problem 2) and = are maximal elements. 


Definition 3.4.8. Let be a partial order on 


e If, (satisfies ,then is called an upper bound for 
e If satisfies , then is called a lower bound for 


Let be the poset given in Example 3 where .Hence, _ isa subset of 
Observe that is anupper bound for _, but there is no lower bound for in 


Definition 3.4.9. Let bea partial order onaset| , and let 


e If is an upper bound for and whenever is another upper bound for, 
then is called the least upper bound for 
e If is a lower bound for and whenever is another lower bound for 


then is called the greatest lower bound for 


Our next lemma shows that whenever a least upper bound for a set exists, it is 
unique. Similarly, if a greatest lower bound exists, then it is unique as well. 


Lemma 3.4.10. Let =beaposeton .Let .If and __ are least upper 
bounds for ,then .If and _ are greatest lower bounds for , then 


Proof. Let and __ be least upper bounds for . Since _ is a least upper 


bound and is another upper bound for _, it follows from the definition of 
least upper bound that . Similarly, it follows that . By antisymmetry, we 
thus conclude that . An analogous argument shows that if and are 
greatest lower bounds for ,then .LI 


Consider the poset. where .Let .So_ isthe least upper bound for  , 
and _ is the greatest lower bound for 


Definition 3.4.11. Let be a partial order on where ,  ,and 


e If and, then is the largest (greatest) element in 
e If and), then is the smallest (least) element in 


There is a simple distinction between an upper bound and a largest element. 
An upper bound for need not be anelementin  .A largest element of is 
just an upper bound that is also an element in _— (see Exercise 4). Similarly, a 
smallest element of is just a lower bound that is also an element in the set 


Lemma 3.4.12. Let bea partial orderon .Let .If and are largest 
elements of ,then .If and  aresmallestelements of , then 


Proof. Let and _ be largest elements of . So, in particular, and are 
both elements in  .Since isalargestelementof and_, it follows that 

. Similarly, it follows that . By antisymmetry, we have that .If = and 
are smallest elements of , an analogous argument shows that. LI 


Example 4. Consider the poset. , where and __eis the divisibility relation. 
Let .Then isthe largestelementin  .Theset has no smallest element; 
however, _ is the greatest lower bound for 


Definition 3.4.13. Let be a poset, and let and be elements in| . Then and 


are said to be comparable if either or ; otherwise they are said to be 


incomparable. 


A partial order on a set = may not be a total order, but it is a total order on 
any subset of in which any two elements are comparable. 


Definition 3.4.14. Let be a partial order onaset| ,andlet' . Then is called a 
chain in if for all and in | either or 


Clearly, if and isachainin’ ,then isalsoachainin 


Example 5. Let and consider the poset , where is the divisibility 
relation. Then the set. isachainin  forwhich _ is an upper bound. The set 
isachainin that has no upper bound. 


Now let bearelationon .Givenany subset of  , wecan also view 
asarelationon (see Exercise 14). 


Definition 3.4.15. Let be arelation on | _, and let . Then the induced relation on 


is defined to be . In other words, 


Definition 3.4.16. Let and be partially ordered sets. A bijection is called an 


isomorphism from onto if satisfies 


forall and in .Ifsuchan isomorphism exists, then and _ are said to 
be isomorphic. 


When two posets are isomorphic, they are structurally the same poset. 


Exercises 3.4 


1. Define a relation on the set of integers by if and only if and is 
even, for all in. Prove that is a partial order on  . Then answer the 
following questions about the poset 

(a) Is achainin ? 

(b) Is achainin ? 

(c) Does the set have a lower bound or an upper bound? 
(d) Does have any maximal or minimal elements? 


2. Prove Lemma 3.4.5. 
3. Find the greatest lower bound of the set in the poset where __. Now find the 


least upper bound of 


4. Let be a poset and let. Suppose that is the largest element of. Prove 
that is also the least upper bound of 
5. Let be a partial order onaset | jandlet| . Suppose and are both greatest 
lower bounds of _. Prove that 
6. Let be the set of all subsets of the set, . Thus, is a partially ordered set, where 


Find an upper bound, the least upper bound, a lower bound, and the greatest lower bound of 
the following subsets of 


(a) 
(b) 
(c) 
(d) 


(e) 
(f) 


7. Let beapartial orderonaset andlet .Suppose and __ are both 

smallest elements of . Prove that 

8. Consider the poset , where ;thatis, isthe set of all subsets of . Let 
bethe chainin defined by 


Does have an upper bound? A least upper bound? 


9. Consider the poset where ;thatis, is the set of all finite subsets of 
.Let bethechainin defined by 


Does have an upper bound? A least upper bound? 


10. Let and __ be posets. Suppose that the function satisfies — if and 
onlyif ,forall and in  .Provethat is one-to-one. 

11. Let be a poset. Suppose is a one-to-one function. Define the 
relation on by ifandonlyif ,forall and in  .Provethat is 
a partial order on 

12. Let be a totally ordered set. Suppose is one-to-one. Define the 
relation on by ifandonlyif ,forall and in  .Provethat is 
a total order on 
13. Let beaposet and let .Suppose that is the smallest element of 
Prove that is also the greatest lower bound of 

14. Let beapartial orderon .Let .Showthat _ isa partial order on 

. Show thatif isatotalorderon ,then isa total order on 
15. Let beaposet.Foreach ,let .Let bedefinedby .Thus, isa 
poset. Prove that 


forall and in .Hence,theposets and _ are isomorphic. 


16. Let and _ beposets. Suppose that _ satisfies 


forall and in .Let_ . Prove the following: 


(a) If isachainin’ ,thentheimage isachainin 
(b) If isachainin ,then  isachainin 
(c) If has anupper bound, then _ has an upper bound. 
(d) If hasanupperboundand  isonto  ,then _ has an upper bound. 
17. Let beapartialorderon  . Show that 
18. Let beaset where each isa partial order on its field. Suppose that 
forany and in ,either’ or 


(a) Showthatforevery and in ,if  ,then 
(b) Let bethe relation withfield . Prove that is a partial order 
on its field. 


19. Let and _ beposets. Define the relation on by 


forall and in. .Therelation _ is called the lexicographic ordering on 


(a) Provethat is a partial ordering on 
(b) Suppose that isatotal orderon andthat isa total order on 
Prove that _is a total order on 


3.5 Congruence and Preorder 


Let be an equivalence relation ona set . Recalling Definition 3.2.19, the 
quotient set denotes the partition of inducedby  . By Theorem 3.2.16, 
we know thatif ,then .Let bea function. Canwesaythatif ,then ? 
In this section, we will pursue such questions. 


An equivalence relation that is preserved under certain identified operations is 
called a congruence. In this case, one can use these operations to construct 
similar operations on the quotient set and thereby address the above question. 


A preorder is a relation that is reflexive and transitive. Thus, a preorder is 
“almost” a partial order. We will also show how one can take a preorder on a set 


and then use it to construct a partial order on a particular quotient set. 


Congruence 


Again, let. be an equivalence relationon . We now investigate the following 
question: Given a function ,canweuse' todefineafunctionfrom to ? 
Specifically, (| )is there afunction — such that 


Is it obvious that such a function _—_ exists? Given an equivalence class, is the 
resulting value of independent of the element — chosen from the equivalence 
class? Suppose that .Thus, by Theorem 3.2.16.If ,then .So would 
not be single-valued, and hence, will not be a function. To avoid this 
difficulty, we must have that 


Definition 3.5.1. Let be an equivalence relation on _, and let be a function. We 


say that) is congruent with if for all and), 


We can now answer question (_). 


Theorem 3.5.2. Let. be an equivalence relationon andlet_. There exists 
afunction — such that 


ifandonlyif is congruent with 
Proof. Let bean equivalence relationon and let 


( ). Let besuchthat forall .Let and  besuchthat .So _ by 
Theorem 3.2.16. Since is a function, we conclude that . Therefore,  , 
again by Theorem 3.2.16. 


( ). Suppose is congruent with .Let . To prove that the relation is 
single-valued, let and .Assumethat .Weshowthat as follows: 


Hence, is a function. Clearly, the domain of equals and_. Therefore, 


and forall .U 


We shall soon show how Theorem 3.5.2 can be extended to both relations and 
binary operations. 


Definition 3.5.3. Any function of the form is called a binary operation on (or just a 


binary operation whenever the context is clear). We shall write for the value when 
and 


Definition 3.5.1 above describes when a given function is “in harmony” with 
an equivalence relation. We now define what it means for a binary operation, 
and a relation, to be in harmony with an equivalence relation. 


Definition 3.5.4. Let be an equivalence relation on and be a binary operation. 


Then __ is congruent with ifforall in 


’ 


Definition 3.5.5. Let be an equivalence relation on | _, and let be a relation on 


Then __ is congruent with ifforall in 


’ 


One can easily adapt the proof of Theorem 3.5.2 to establish our next two 
theorems (see Exercises 2 and 3, respectively). The following theorem extends 


Theorem 3.5.2 to binary operations. 


Theorem 3.5.6. Let — be an equivalence relation on 
afunction — such that 


ifandonlyif is congruent with 


We now extend Theorem 3.5.2 to relations. 


Theorem 3.5.7. Let — be an equivalence relation on 
on .Thereexistsarelation on __ such that 


ifandonly if is congruent with 


Preorder 


Definition 3.5.8. A relation on a set is a preorder if 


that is, forany  ,  ,and in, the following hold: 


and), then 


and let . There exists 


,and let be a relation 


is reflexive and transitive; 


When isa preorder on the set , we shall say that is a preordered set 
or a proset. Of course, if a preorder is also antisymmetric, then it is a partial 


order. On the other hand, there are preorders that are not partial orders. 


Example 1.Let and .Definetherelation on by 


forall and  ,where denotes the number of elements in. One can easily 
see that the relation __ is reflexive and transitive; that is, is a preorder on 
Clearly, and .Since ,weseethat is notantisymmetric.So isnota 
partial order on 


Lemma 3.5.9. Let be a preordered set, and let be the relation on 
defined by 


Then _ is an equivalence relation on 
Proof. See Exercise 4. LJ 


The relation in Lemma 3.5.9 is called the derived equivalence relation. 


Lemma 3.5.10. Suppose that is a preordered set. Let be the derived 
equivalence relationon .Then is congruent with _ ; that is, for all for all 
in ,if and 4,then _ if andonlyif 


Proof. See Exercise 5. LJ 
When _ is a preordered set, Lemma 3.5.10 allows us to define a partial order 


on the quotient set. when __is the derived equivalence relation. 


Theorem 3.5.11. Suppose is a preordered set, and let be the derived 


equivalence relationon .Considerthe relation on defined by 


(3.7) 


forall and in .Then isa partial order on 


Proof. Let bea proset, and let —_—be the derived equivalence relation. By 
Lemma 3.5.10, is congruent with . Theorem 3.5.7 implies that there is a 
relation satisfying (3.7). Since is a preorder on __—, one can now show 
(using (3.7)) that isapartial orderon (see Exercise 6). L] 


Definition 3.5.12. Let be a preorder on. Then is a total preorder on if 


satisfies the property: __; that is, either or| , for any and in 


Corollary 3.5.13. Suppose that. _is a preordered set. Let __ be the relation on 
defined by (3.7) where _ is the derived equivalence relationon .If isa 
total preorderon ,then isa total order on 


Proof. See Exercise 7. LJ 


Exercises 3.5 


1. Let be an equivalence relationon  _. Let be defined by 


Show that the function is one-to-one if and only if each equivalence class in 
is a singleton. 


. Prove Theorem 3.5.6. 
. Prove Theorem 3.5.7. 
Prove Lemma 3.5.9. 

. Prove Lemma 3.5.10. 

. Prove Theorem 3.5.11. 


7. Prove Corollary 3.5.13. 
8. Let  . Define the equivalence relation on by 


au Sp WN 


For example, and 


(a) Is there an so that forall, ? 
(b) Is there an such that forall, ? 
(c) Is there an such that forall, ? 


9. Let be the set of natural numbers and let be the standard order relation on 


. Let) . Consider the relation on defined by 


(a) Show that not antisymmetric. 
(b) Prove that is a preorder on 
(c) Prove that is a total preorder on 


(d) Let be the derived equivalence relation on _. Is there a function such 
that ? 
(e) Define by. Now show that is congruent with 


Exercise Notes: Exercise 9 offers a first step in a set-theoretic construction of 
the integers using the natural numbers (see Remark 4.4.15). 


4 
The Natural Numbers 


In this chapter, it shall be shown that number theory can be embedded within set 
theory. One consequence of this embedding is that all of the theorems in number 
theory can be proven from the axioms of set theory. 


In order to show that number theory is, in fact, a special branch of set theory, 
we must first represent each natural number as a set. How can one define the 
natural numbers in set theory? To answer this question, we next consider a set 
theoretic construction that makes sense for any set. 


Definition 4.0.1. For each set 7 , the successor ».* is the set that is obtained by adjoining 


Z tothe elements of  , namely, g:* ru { x} : 


We note the following three properties concerning the successor of a set «x : 


1. aca” iff (@e€xVa=2). 
2. x€ at. 
3. eg. 


Using the successor operation, we can now construct, in set theory, the first 
few natural numbers as follows: 


e 0=0. 
*1l=0 OU {O} = {0}. 
© 2Z=1' Lwtit = 76,1}. 


© 3=2t+ =2uU {2} = {0,1,2}. 
© 4= 3+ = 3U {3} = {0,1,2,3}. 


We note some interesting properties that these “natural numbers” possess (see 
the above bulleted list): 


1. SPE LSRVESerene.... 
2. SCCILSSCCSCWCES xix: 


4.1 Inductive Sets 


The existence of an infinite set is crucial in modern set theory and mathematics. 
Zermelo realized that one cannot prove that an infinite set exists and thus found 
a fairly simple way to assert the existence of an infinite set. 


Infinity Axiom. There is a set A that contains the empty set as an element and 
whenever x € A, thenyt Ec A. 


Using the infinity axiom, we shall prove that there is a set consisting of only 
the natural numbers; this set is denoted by w . First, we define a property of a set 
that will ensure that each natural number belongs to such a set. A set that 
satisfies this property will contain the empty set and will be closed under the 
successor operation. 


Definition 4.1.1. A set / is said to be inductive if and only if 


1@e€Ef7,2. (Va Ee I)(a re [), that is, J is “closed under successor.” 


An inductive set contains @ and also contains the successor of each one of its 


elements. Hence, an inductive set must contain all of the “natural numbers.” 
Clearly, the infinity axiom declares that there is an inductive set. 


Definition 4.1.2. A natural number is a set that belongs, as an element, to every inductive 


set. 


Therefore, 2 is a natural number if and only if x is in every inductive set. 
We now want to prove that there is a set consisting of just the natural numbers. 
To do this, we need to prove that the class 


{x : x is in every inductive set} 
(4.1) 


is a set. By the infinity axiom, we know that there is an inductive set 4 . Hence, 
for each x, if a is in every inductive set, then g € A. Theorem 2.1.3 therefore 
implies that (4.1) defines a set, and this set consists of the natural numbers. 


Definition 4.1.3. The set consisting of the natural numbers is denoted by Ww. 


Of course, w is the set {0,1,2,3,...} , but we will identify w as the set that 
satisfies 


x € w if and only if x is in every inductive set. 
(4.2) 


Theorem 4.1.4. The set w is inductive and w is a subset of every inductive set. 
Thus, w is the smallest inductive set. 


Proof. We first prove that w is an inductive set; that is, we shall prove that 

(1) ©€w,(2) (VaeE w)(a* Ew). 
Since @ is in every inductive set, (4.2) implies that @ € w. To prove (2), let 
a € w. We thus conclude that a is in every inductive set. So q* is also in 
every inductive set. Hence, q* € y and (2) holds. Now let 7 be an inductive 


set and let « € w. The above (4.2) implies that x is in every inductive set. 
Therefore, 7 € J and thus, w C/.U 


We now prove the following simple corollary. 


Corollary 4.1.5. Let J C w be an inductive set. Then J = w. 


Proof. Let / be an inductive set such that / C w. Since J is an inductive set, 
Theorem 4.1.4 implies that w C J.Hence, J = w.U 


Corollary 4.1.5 is, in fact, a restatement of the principle of mathematical 
induction. Suppose that P(n) is some property. To prove by induction that 


(Vn € w)P(n) 


is true, let J = {n € w: P(n)} andso, J C w. If we can prove that 


(1) OE€7,(02) Wnew)(nerlon* el), 


then / is inductive and thus, ~—w by Corollary 4.1.5. Therefore, 
(¥n € w)P(n) holds. Item (1) is often called the base step, and (2) is 
commonly refereed to as the inductive step. We apply this method of proof to 
establish our next theorem. 


Theorem 4.1.6. For every n € w, either » = 0 or yn — &*+ forsome k € w. 


Proof. Let J = {né€w:n=O0V (ak €w)(n=k*)}. Clearly, OE J. Let 
n€ [. So either n = 0 or n — kt for some & € w. Therefore, p+ — (+ or 
nt =(k*)* where kt €w. Thus, n+ € J. Hence, by Corollary 4.1.5, 
T=w.U 


Definition 4.1.7. A set 4_ is said to be a transitive set if (Va A) (a c: A) ; that is, 


every member of 4 is alsoasubsetof A . 


Therefore, for any set A we have that 
A is transitive iff (Va)(Vz)|((a@ €aAa€ A) 42@EAl (4.3) 


if | JACA. (4.4) 


Example 1. The empty set © is, vacuously, transitive. Each of the following 
sets is also transitive: 


° 1=0* (= {0}), 

« 2=1* (= {0,1}). 

e $= 3h = 20,1,21). 

« d= SF (= 10,1,2. 8} i. 
e 5= 4+ (= {0,1,2,3, 4}). 


On the other hand, the set A = {{0}} is not transitive because {0} € A and 
yet {O} Z A. Similarly, the set {0, 1,3} is not transitive. 


Remark 4.1.8. Regrettably, we are now using the word “transitive” to refer to 


two different concepts. A relation ona set A is transitive provided that for 
ala, ¥,z in A,if «Ry and yRz, then xRz (see Definition 3.2.11). A set 
A is transitive, if whenever x € A, then 2 C A.Ifaset A is transitive, then 
one cannot infer that the membership relation € on A is a transitive relation; 
that is, one cannot always conclude that if « € y and y € z, then x € z, forall 
Z,¥.2mJ4. 


Proposition 4.1.9. Let 4 and B be sets. Then | )(AU B) = (UJ) A)U(UB). 
Theorem 4.1.10. Let a bea transitive set. Then U(a" =7: 


Proof. Suppose that a is a transitive set. Then 


LJ(a*) = (a U {a}) by def. successor 
LU aU | {a} by Proposition 4.1.9 
LU aUa because | J{a} =a 


=¢@ as Ja C a by (4.4). 


Therefore, | )(a*) =a.U 


Theorem 4.1.11. Every natural number is a transitive set. 


Proof. We prove that (Vn € w)(n is a transitive set) by induction. Let 
I={n€w:nisa transitive set}. 


Clearly, 0 € J, as 0 = © is a transitive set. Let » € J. So n is transitive. To 
prove that ,,+ is transitive, we will show that [Jn* Cn* (see 4.4). By 


Theorem 4.1.10, [Jn* =n. Since nC n* (see item 3 on page 83), we 
conclude that U n* Cn. Hence, -* is a transitive set and thus, n+ € /. By 
Corollary 4.1.5, we conclude that J = w, and this completes the proof. 


Theorem 4.1.11 implies that if @ € h and } € n, then a € n, whenever 7 is 
a natural number. We now prove that the successor function on W is one-to-one. 


Theorem 4.1.12. For all m and n inw,if p+ — m*t,thenn =m. 


Proof. Let m € w and n € w. It follows from Theorem 4.1.11 that m and n 
are transitive sets. Assume that p+ — y+. Thus, |)n* =|)m”. Therefore, 


n =m by Theorem 4.1.10. UJ 


Theorem 4.1.13. The set w is a transitive set. 


Proof. We shall prove that (Yn € w)(m C w) by mathematical induction. Let 
I={new:nCw}. Clearly, 0€ J, because 0O= OG and OG Cw. Let 
neée].So (IH)! n Cw. We shall prove that n* C w. To do this, let & € n*. 
We must prove that k€@w. Since k€n* =nU{n}, we conclude that 


ken ok Nh: If ken, then kew by (IH). If k Nh then kew 
because n € w. Hence, n+ € ]. Therefore, J — w. U 


Exercises 4.1 


1. Let J and .J be inductive sets. Prove that J ( .J is also inductive. 

* 2. Prove that if A. isa transitive set, then Aw is also a transitive set. 

3. Prove that A isa transitive set if and only if A . P( A) : 

4. Prove that if 4 is a transitive set, then U A isa transitive set. 

5. Let 4 be a nonempty set such that /} is a transitive set, for all B € A. 
Prove that () A and 'S) A are transitive sets. 


. Prove Proposition 4.1.9. 
. Prove that 72 f- n* forall nm Ew. 


6 

7 

8. Conclude from Exercise 7 that 77,7 Z n foralm Ew. 

9. Prove that for all 72 € W andall 2 € w,ifm En, then n Z mM. 

* 10. Conclude from Exercise 9 that 72 g n forall 2 Ew. 

11. Let A bea set and suppose that L A A . Prove that A is transitive and 
for all ¢ € A ,thereisa y € A such that 2 € y. 


Exercise Notes: For Exercise 7, apply Theorem 4.1.12 for the inductive step. For 
Exercise 9, let 772 € w and let / {n Ew: (m Enon Z m)} . Prove 
that J is inductive. Assume 7, € [.Toprove 7»* € J let En*.SomeEn 
or 7 n.Ifm € n, then 7» € J implies that - Z m. and thus, 727 Z mM . 
If m n., then 7” Z m. by Exercise 8. 


4.2 The Recursion Theorem on w 


Induction is frequently applied in mathematical proofs. Moreover, one often 
defines a function by induction (recursion). A recursively defined function is one 
that is defined in terms of “previously evaluated values of the function.” A 
function ), on w& is defined recursively if its value at () is first specified and all 
of the remaining values are then defined using an earlier value, together with a 
given function. For example, let f: A-— A be a function. Suppose that the 
initial value of a function f is given to be h(0) = a fora fixed q € A, and to 
get the next value h(1) we use the initial value to obtain 
h(1) = f(h(0)) = f(a). Similarly, to get the next value h(2) we must 
evaluate f(h(1)). By continuing in this manner, we can evaluate more and more 
values of the function / . A more succinct way of describing this function /, is 
to first define (i) h(O) =a and then define the remaining values by (ii) 
h(n*) = f(h(n)). Thus, one can evaluate h(0), h(1), ..., h(100). Just 


because we can evaluate a finite number of values, do we really know that there 
is such a function / with domain w? The existence of such a function seems 
quite reasonable. In mathematics, however, the acceptance of an existential 
Statement requires a valid argument, that is, a proof. The proof of our next 
theorem shows that such a function does exist and that it is unique. 


Recursion Theorem 4.2.1. Let 4 be a set and q@¢€ A. Suppose that 
f: A— A isa function. Then there exists a unique function ): w —+ A such 
that 


(1) h(O) =a,(2) h(n*) = f(h(n)), forall n Ew. 


The function /, in the recursion theorem, satisfying (1) and (2), is said to be 
defined by recursion. If a function has been defined by recursion, then proofs of 
statements about this function often use “proof by induction.” Before we prove 
the recursion theorem, we make an observation. First, let us assume that the 
function /, exists. When viewing /, as a set of ordered pairs, conditions (1) and 
(2) imply that 


e (0, a) € h, and 
e if (n,u) Eh, then (n*, f(u)) Eh, foralln ew. 


In our proof, we will show that there exists a function /, that satisfies the above 
bulleted conditions. 


Proof. Let A beaset, @€ A, and let f: A— A bea function. Let us call a 
relation R C w x A suitable if the following two conditions hold: 

(i) (,a)ER. 

(ii) If (n,u) € R, then (n*, f(u)) € R. 


Our goal is to construct a suitable relation that is also a function. 


Since every suitable relation is a subset of w x A , Theorem 2.1.3 implies that 
the clas S={R: Ris suitable} is, in fact, a set. Because wx A is a 
suitable relation, § is nonempty. Let h a S.Clearly,h Cw x A,andso h 
is a relation. We now prove that /) is suitable and that it is a function. 


Claim 1. /, is suitable. 


Proof. We need to show that / satisfies items (i) and (ii). Every relation in § 
is suitable. So (0,a) is an element in every relation in §. Thus, (0,a) €()S; 
that is, (0,a) € h. To prove (ii), assume that (n,u) Eh. As h=[)]S, it 
follows that (m,u) belongs to every relation in S. Let RES. Since 
(n,u) € R and R is suitable, item (ii) implies that (n*, f(w)) € R. So 
(n*, f(w)) belongs to every relation in §, and hence, (n*, f(w)) € h. Thus, 
h is suitable. (Claim1) U 


We now must prove that /, is a function from w to A. 


Claim 2. }, is a function from w to A. 


Proof. To prove that /, is a function from w to 4 , we must show that for each 
n€w, there is exactly one y € A_ such that (n,y) Eh. Let TCw be 
defined by 
I = {n €w: there is exactly one u € A such that (n,u) € h}. 


We now prove that J is an inductive set. To do this, we must first prove that 
0 € J. We know that (0,a) € h . To prove that 0 € J, we need to show there 
is no (0,b) € h where a 4b. Suppose, for a contradiction, that (0,b) € h 


where a ¢ b. Consider the relation /¢ = / \ {(0, 6)}. 


Clearly, (a) (0,6) € R. As h is suitable, it easily follows that R is also 
suitable (because n* # () for all n € w). Hence, RES. Since h (|S; we 
conclude that hCR, and thus, (0,b)€ R, which contradicts (a). 
Consequently, a is the only element in A such that (0,a) € h . Therefore, 
OE]. 


Let mn € J]; that is, assume that (IH) there is a unique 4% € A_ such that 
(n,u) € h . Because /; is suitable, we conclude that (n*, f(u)) € h. To show 


that m* € 7, we must show there is no (n*,b)€h where f(u) #b. 
Assume, for a contradiction, that (n*,b) € h where f(u) # 6. Consider the 


relation R=h\ {(n* ,b)}. 


Clearly, (@) (n* ,b) € R. We now show that R is suitable. Since f is 
suitable and n* #0), we have only to show that if (n,v) € R, then 
(n*, f(v)) € R. To do this, assume that (n,v) € R. Thus, (n,v) € h, and 
so (n*, f(v)) Eh. Since (n,u) Eh, (n,v) Eh, and n € J, we conclude 
from (IH) that w = v, and thus, f(w) = f(v). Because f(v) = f(u) Ab and 
(n*, f(v)) €h, we infer that (n*,f(v)) € R. So R is suitable. Hence, 
ReES. Since h={)S, we conclude that h C R, and thus, (n*,b) € R, 
which contradicts (@). Therefore, f(w) is the only element in A such that 
(n*, f(u)) € h.So y+ € J, and thus, / is inductive. Corollary 4.1.5 implies 
that J — w, andasaresult, ) is afunction from w to 4.(Claim2) OU 


Because the function /, is suitable, it satisfies conditions (1) and (2) given in 
the statement of the theorem. To prove that f/ is unique, suppose that 
g:w-— A also satisfies properties (1) and (2). Thus, g is a suitable relation, 
and hence, g € S. Since h={)S, we have that h Cg. Exercise 10 on 
page 68 implies that h = g. Therefore, ) is unique. (Theorem) LU 


Example 1. Let f: R — R and let @ = 2. Theorem 4.2.1 implies that there is 
a function h:w—R such that h(0)=2 and h(n*)= f(h(n)), for all 
new. 


When can we be assured that the function /, in Theorem 4.2.1 is one-to-one? 
Our next theorem provides an answer. 


Theorem 4.2.2. Let 4 be aset, aE A, a¢ran(f), and f: A+A bea 
function. Suppose that ): w — A satisfies 


(1) h(0) =a, (2) h(n*t) = f(h(n)), foralln Ew. 


If f is one-to-one, then /, is also one-to-one. 


Proof. Let {: A—-A_ be one-to-one, ae A, a¢ran(f), and let 
h:w— A Satisfy (1) and (2). We prove that for all nm € w andall k € w, if 
h(n)=h(k), then yn—=k. Let ICw be defined by 
T={né€w: (Vk €w)(h(n) = h(k) 3 n=k)}. 


We shall show that J is an inductive set. To prove that 0 € J, we must show 
that 


(Vk € w)(h(0) = h(k) + 0 =k). 
(4.5) 


So let k € w and assume that h(0) = h(k). By (1), we have h(0) = a. Hence, 
h(k) =a. Suppose that k #0. Then &=j* for some j €w by 
Theorem 4.1.6. Since h{k) = a, we conclude that h(j*) = a. The above item 
(2) now implies that f(h(j)) =a. Thus, a € ran(f), which contradicts the 
fact that a ¢ ran(f). Hence, & = 0. Therefore, (4.5) holds, and so ( € J. 


Let meée/J. In other words, assume the _ induction hypothesis 
(Vk E€ w)(h(n) = h(k) + n=k). (TH) 


We shall prove that n»*¢€/J/; that is, we will prove _ that 
(Vk €w)(h(n*) = h(k) 3 n* =k). 


Let k € w and suppose that (A) h(n*) = h(k). Because n* 4 0, the above 
(4.5) and (&) imply that k #0. So k=j* for some 7 €w. Hence, 
h(n*) = h(j*). Item (2) implies that f(h(n)) = f(h(j)). Since f is one-to- 
one, we conclude that h(m) = A(j). Our induction hypothesis (IH) implies that 
n= Jj, and thus, n* = 7°; that is, p+ — &. Hence, n+ € J, andso J is an 
inductive set. Therefore, J — y and / isa one-to-one function. LJ 


4.2.1 The Peano Postulates? 


The Peano postulates offer an axiomatic foundation for the natural numbers. 
These postulates were first presented by the nineteenth-century 
Italian mathematician Giuseppe Peano. In 1888, Richard Dedekind proposed a 
set of axioms about the natural numbers. Peano then published a more precisely 
formulated version of these axioms in his 1889 book The principles of arithmetic 
presented by a new method [13]. The essence of the Peano postulates shall be 
summarized by means of the following two definitions. 


Definition 4.2.3. Let S$: NM’ —s NV and let A C JN. Then A is said to be closed 


under § if forall w, if a € A , then S(a) CA. 


Let (a, y,z) denote an ordered triple, that is, (2, y,z) = ({a,y),z). 


Definition 4.2.4. Let (N 9, ¢ ) be an ordered triple that consists of a set /V , a function 
S: N — WN, anda particular element ¢ € /V . Then (N 5.€ ) is a Peano system 


if the following three conditions hold: 


(1) e €ran(S). 
(2) § is one-to-one. 
(3) Forall A C N,ife € A and A isclosed under §', then A 


The above condition (3) is referred to as the induction postulate. Figure 4.1 
illustrates a Peano system. 


e S(e) S(S(e)) s(e) ste) 


e—> @ ——-> 6 —-_> 6 —--> 6 —-> 6 —-—- ©: :; 


Figure 4.1. A Peano system (V, S,e) 


Theorem 4.1.4 shows that w is inductive. Thus, we can define the function 
g:w—-+w by a(n)=n* for all nm € w. We can now prove that a Peano 


system exists. 


Theorem 4.2.5. The triple (w,a,0) is a Peano system. 


Proof. Since w is inductive, we have that 0 € w and that o(m) € w for all 
né€w. Clearly, n+ 40 for all n€w, as nent and n¢O. Thus, 
0 ¢ ran(o). Theorem 4.1.12 shows that o is one-to-one. Finally, we need to 
show that (w,o,0) satisfies the induction postulate. Let A C w be such that 
QO¢A and A is closed under 0. So A Cw is inductive. Corollary 4.1.5 
implies that 4 — yu. Therefore, (w,oa,() is a Peano system because it satisfies 
conditions (1)-(3) of Definition 4.2.4. LJ 


Definition 4.2.6. Let ( N.S. € ) and ( Ni OS! o be Peano systems. Then 
(N, S,¢ ) is isomorphic to (N a ad Jf when there is a bijection 7: \¥V —s NV’ 
such that 


(1) w(e) =e’, and(Q2) 7(S(x)) = S'(x(x)) forall gE N- 


The function 7 is said to be an isomorphism from (/V,S,e) onto (N’,S’,e’). 


Two Peano systems are isomorphic if they are essentially the same structure. 
We now prove that (w,a,0) is isomorphic to any Peano system. Thus, the set of 
natural numbers w is “the same” as the set NJ) that is used in calculus. 


Theorem 4.2.7. Let (V,S,e) be a Peano system. Then (w,a,0) is isomorphic 
to (N, S,e). 


Proof. Let (V,S,e) be a Peano system. So S$; N -+ N is one-to-one, e € N, 
and e €ran(S$). Since o(m) =n*, Theorem 4.2.1 implies that there is a 
function 7: w —+ N such that 


(1) (0) =e,(2) x(a(n)) = S(x(n)), forall n Ew. 


As e €ran(S$) and § is one-to-one, Theorem 4.2.2 implies that 7 is one-to- 
one. To show that mw is onto MN, we use the induction postulate 
(Definition 4.2.4(3)). Clearly, ran(a7) C N. Since 7(0) =e, we see that 
e€ran(z). Let «€ran(z). Thus, 7(n)=2 for some n€w. Hence, 
S(x(n)) = S(a). By (2), we conclude that 7(a(n)) = r(n*) = S(x). So 
S(a) € ran(7), and ran(7) is thereby closed under §. The induction 
postulate implies that ran(7) = N, that is, 7 is onto VY. Therefore, 7 is an 
isomorphism from (w,a,0) onto (NV,S,e).0 


Theorem 4.2.7 implies that the Peano postulates characterize the system of 
natural numbers (up to isomorphism). Theorem 4.2.7 also implies that any two 
Peano systems are isomorphic to each other (see Exercises 7—8). 


Exercises 4.2 


1. How would one modify the proof of Theorem 4.2.1 in order to establish the 
following: 


Theorem. Let 4 be a set and let g@€ A. Suppose that 
Z :w > A—-> A is a function. Then there exists a unique function 


he: w —- A such that 


(1) h(O) =a.(2) h(n*) = f(n, h(n)), forall n € w. 


2. Let @eE A and f: A — A be a function such that q & ran( by }- 
Suppose that fy: 4» —s A. satisfies 


(1) h(O) =a.) h(n*) = f(R(n)), forall n € w. 


Assume that f/ is one-to-one and onto A . Prove that f is one-to-one. 


3. Let A beasetandlet q € A . Suppose that f : A = A satisfies 
(a) a€ f(a) ,(b) if & € Y then f(x) Ec f(y), for all 2 and Yin A. 


Let fp: uy —s A be as stated in Theorem 4.2.1. Prove that h(n) E h(n*) for all 
new. 


4. Let F: A + A bea function and let yE A. 


(a) Prove that the clas S={B: BCA, y€ B, and F|B| C B} isa 
set. 

(b) Show that § is nonempty. 

(c) Let C =()]S. Prove that yeE C and FIC] CC. 

(d) Prove that forall BC A,if ye B and F|B)C B,thn CCB. 

(e) Prove that y € F'|C| if andonly if F|C] =C. 


5. Let #: A + A bea function andlet Y C A. 


(a) Prove that the class S= {B:Y C BC A and F|B] C B} isaset. 
(b) Show that S is nonempty. 

(c) Let C =(]S. Prove that Y CC and FIC] CC. 

(d) Prove that forall BC A,if Y C B and F|B|) CB,thn CCB. 
(e) Prove that Y C F'|C| if and only if F|C] =C. 


6. Let fF: A- A _ be a function and let Y C A. The recursion theorem 
implies the existence of the function h: w — P(A) that satisfies: 


(1) h(0) = Y,(2) h(n*) = h(n) U F{h(n)], for alln Ew. 


Now address the following: 


(a) Let Y C BCA and F{B| C B. Prove by induction that h(n) C B for 
alnéew. 

(b) Let C’ = Une, h(m). Prove that Y CC’ C A and F[C’) CC’. 

(c) Prove forall BC A,if Y C B and F{B|) C B, then c’ CB. 

(d) Let @ be as in the previous Exercise 5. Prove that ©’ — (. 


“7. Let (N,S,e) and (N’,S‘,e’) be Peano systems. Let 7 be an 
isomorphism from (V,S,e) onto (N’,S’,e’). Prove that 7-1! is thus an 
isomorphism from (N’,S’,e’) onto (N,S,e). 

*8. Let (N,S,e) and (N’,S’,e’) be Peano systems. Let 7 be an 
isomorphism from (V,S,e) onto (w,oa,0), and let P be an isomorphism from 
(w,a,0) onto (N’,S‘,e’). Show that the composition poO7 is an 
isomorphism from (V,S,e) onto (N’,S’,e’). 


Exercise Notes: Exercises 4—5 do not involve Theorem 4.2.1, but they do apply 
some of the ideas in its proof. For Exercise 4(e) and for Exercise 5(e), in the 
direction (=> ), prove that /’|C'] € S. For Exercise 6, item (a) implies (c); to 
prove (d), use (b) and (c) together with the results of Exercise 5(c)(d) to show 
that C’ CC andccc". 


4.3 Arithmetic on w 


In this section, the recursion theorem will be used to define the operations of 
addition, multiplication, and exponentiation on the natural numbers. We will 
also prove the properties of arithmetic that we were taught as children. We first 
establish the following result. 


Theorem 4.3.1. Let g: w — w and f: w x w — w be functions. Then there is 
a unique function A: w x w— w such that foreach m Ew, 


(1) h(m,0) = g(m), (2) h(m,n*) = f(h(m,n),m), for all n € w. 


Proof. For each m € w, by the recursion theorem (viewing 77 as a constant), 
there is a unique function Pm: “ — W such that 


(a) Pm(0) = g(m), (0) pm(n*) = f(pm(n), m), forall n € w. 


Since Pm is unique for each m € w, the connection 72 ++ Pym, thus produces a 
function with domain w. We can now define the function h: w x w—-+ w by 
h(m,n) = pm (nm). The function f, easily satisfies conditions (1) and (2) for all 
m € w, and it is unique. L] 


The function /, in Theorem 4.3.1 is defined in terms of two known functions. 
Thus, before one can apply Theorem 4.3.1, one must first introduce the two 
relevant functions 9: ¥™ — W and f:wxw—-w. 


Recalling Definition 3.5.3, a function of the form h: B x B + B iscalleda 
binary operation on # . Our next definition is an application of Theorem 4.3.1; 
that is, using the functions g(m) =m and f(k,m) = k*, we define addition 
on the natural numbers. 


Definition 4.3.2. Let A: w * Ww —> w be the unique function satisfying 


(1) A(m,0) =m,() A(m,n*) = A(m,n)* 


for all natural numbers m and mn. The function A shall be referred to as 


addition, and we define the binary operation + on w tobe 7” +” A(m, n) 


forallm andn inw. 


Theorem 4.3.3. For all natural numbers 7m and 7, 


(Al) m+0=m,(A2) m+n (m4 n)*. 


Proof. Let ™ and n be elements in w. Recalling that A(m,n) = m+n, we 
see that (1) and (2) in Definition 4.3.2 immediately produce (A1) and (A2). O 


The next proposition shows that mm + 1] is equal to 7,+, the successor of m . 
After we prove this proposition, we will show that 2 4+ 2 = 4. 


Proposition 4.3.4. For all m € w,m-+1=m?*. 


m+1=m+0° 
(m+0)* by (A2) 
_— mt a, 
Proof. Let m € w. Since 1 — 9+, we obtain eo by (Al) 
and thus, m+ 1 = m*.U 
We can now prove that 2+2=4 as follows: 


ore e2t i" because 2 = 1* 
=(2+1)" by (A2) 


er by Prop. 4.3.4 
=a as2t =3 
=4 as 3° = 4, 


Now that we have the operation -+ of addition, we can apply Theorem 4.3.1 
to define multiplication on the natural numbers using the functions g(m) = 0 
and f(k,m)=k+m. 


Definition 4.3.5. Let Vf: w * Ww —> w be the unique function satisfying 
(1) M(m,0) =0,2 M(m,n*) = M(m,n) +m 


cc | 


for all natural numbers m and n. The function jf is called multiplication, 


and we define the binary operation - on w tobe 7°” = M (m,m) 


forallm andn inw. 


Theorem 4.3.6. For all natural numbers 7m and mn, 


(M1) m-0=0 
(M2) m-n+=m-n+m.- 


Proof. Let m and n be elements in w. Recalling that (m,n) = m-n, we 
see that (1) and (2) in Definition 4.3.5 yields (M1) and (M2). LJ 


Having presented the set-theoretic definitions of addition and multiplication, 
we will now show that some of the familiar laws of arithmetic are provable 
within set theory. Afterwards, we shall define exponentiation. 


Theorem 4.3.7 (Associative Law for Addition). For all m, nm, and P inw, 
m+(n+p)=(m+n)+>p. 


Proof. Let m and n be elements in w, and _ let 
I={pew:m+(n+p)=(m+n) +p}. 
We shall show that / is an inductive set. To show that 0 € / , observe that 
m+(n+0)=m+n by (Al) 
(m+n)+0_ by (Al) 


and thus, m+ (n+0) = (m-+n)+0, and so 0€/. Assume that pe J. 
Then 


m+(n+pt)=m+(n+p)* by (A2) 

(m+(n+p))* by (A2) 
=((m+n)+p)" since pel 

=(m+n)+p" by (A2). 


Therefore, m + (n+ p*) = (m+n) + p*, andhence, p* € J.U 


For n € w, we know that n + 0 = n by (A1). Since we have not proven that 
addition is commutative, we cannot immediately infer that 0 4+ m — mn, and thus, 
we must prove that this latter equation holds for all n € w. 


Lemma 4.3.8. For all n € w, we have) +n=n. 


Proof. We prove that (VnEw)(0+n=n)_ by induction. Let 
T={néew:0+n=n}. 


We prove that / is inductive. Since (A1) implies that 0 + 0 = O, we have that 
Qe ].Letne].So0+n=n. We prove that 0 + n* = n* as follows: 
O+nt =(0+n)* by (A2) 


— a because n € J. 


Therefore, 7+ € J, and this completes the proof. L] 


Item (A2) states that m+ n* = (m+ n)* for natural numbers m and n. 
We will now prove that we also have that m* +- n (m+ n)*. Then we will 
be able to prove that addition is commutative. 


Lemma 4.3.9. For all m € w andn € w, we have m* +n=(m-+n)”. 


Proof. Let mew be arbitrary. We prove that 
(Yn € w)(m* +n = (m+ n)*) by induction. Let 


IT={n€w:m™+n=(m+n)*}. 


To prove the lemma, we just need to show that / is inductive. First, we verify 
that 0 € J by showing that m* + 0 = (m+ 0)” as follows: 


m*+0=m' by (A1) 


=(m+0)* by (A1). 


Thus, Oef. Suppose that nef. Then 
m*+nt =(m* +n)t by (A2) 


((m+n)")* because n € I 


=(m+n")* by (A2) 


and hence, m* +n* = (m-+n*)*. Therefore, »+ € J and J is inductive. 
O 


Lemmas 4.3.8 and 4.3.9 will be used to prove that addition is commutative. 


Theorem 4.3.10 (Commutative Law for Addition). For all m and nm inw, 
mtn=n+m. 


Proof. Let n be any element in w and let / {mew:m+n=n+m}. 


We prove that / is an inductive set. Observe that 04+n—=n-+0 by 


Lemma 4.3.8 and (Al). So Q¢€/J. Assume that me/J. Then 
m*+n=(m+n)* by Lemma 4.3.9 


(n } m) "since m € I 


=n+m' by (A2). 
Hence, m+ +n —=n-+ mt, and therefore, m+ € 7.U 


Using proof by induction, we will now show that the familiar distributive 
law of arithmetic holds. We will thus have additional evidence to support the 
assertion “Mathematics can be embedded in set theory.” 


Theorem 4.3.11 (Distributive Law). For all natural numbers m, n, and Pp, 
m:-(n+p)=m-n+m-p. 


Proof. Let n and m be in Ww. Now let 
I={pew:m-(n+p)=m-n+m-p}. 
We will show that / is an inductive set. To verify that 0 € J, we have 
m-(n+0)=m-n by (A1) 
m-n+0 by (A1) 
=m-n+m-0_ by (M1). 


Thus, m-(n +0) =m-n-+m-0, and so (0 € J. Now, suppose that p € J. 
Then 


m:(n+p*)=m-(n+p)* by (A2) 
m-(n+p)+m by (M2) 


| 


=(m-n+m-p)+m aspel 


| 


=m-:n+(m-p+m)_ by Theorem 4.3.7 
=m-n+m-p* by (M2). 


Hence, m+ +n—=n-+mt, and thus, m+ € 7.U 


Our next two theorems verify two fundamental properties of multiplication. 


Theorem 4.3.12 (Associative Law for Multiplication). For all m,n, and p in 
wy. m-(n-p)=(m-n)-p. 


Proof. Let m and mn be elements in wW and let 
I={peéw:m-(n-p)=(m-n)-p}. 
We shall show that / is an inductive set. To show that () € / , observe that 
m-(n-0)=m-0 by (M1) 
=.0 by (M1) 
=(m-n)-0_ by (M1) 


and thus, m-(n-0) = (m-n)-0,andso 0 € J. Assume that p € J. Then 


m-(n-p+n) by (M2) 


m:(n-p*) 


m-(n-p)+m-n_ by Theorem 4.3.11 


=(m-n)-p+m-n_ since peé I 
— (m . n) . ot by (M2). 


Therefore, m-(n-p*)=(m-n)-p*,andhence, pt € 7.0 


Theorem 4.3.13 (Commutative Law for Multiplication). For all m and nm in w 
m-n=n-mM. 


’ 


Proof. See Exercise 6. LJ 


We will now apply Theorem 4.3.1 to define exponentiation on the natural 
numbers, using the functions g(m) = 1 and f(k,m) =k-m. 


Definition 4.3.14. Let fy): uw * uw —+ w be the unique function satisfying 


Q) E(m,0) = 1,2) E(m,n*) = E(m,n)-m 


for all natural numbers ™ and nm. The function / is called exponentiation. 


re 7 
We define the binary operation, denoted by m”, to be ”” E(m,n) 


forall m andn inw. 


Theorem 4.3.15. For all natural numbers ™ and 7, 


(E1) m° 1,(E2) mr’ m”™-m-: 


Exercises 4.3 


1, Let mEw and me€w. Suppose m+n=—Q(. Prove that 


m= 8 =|. 
2. Let 7 € w and nm € w. Show that if yx - n Qj, then yp Q) or 


n = 0. 
3. Let m2 € W and nm € W. Prove thatforal DE wW,ifm+p=n+p 
,then 72 — nN. 


4. Prove that for all 72 € uw, the equality () . 7» () holds. 
5. Prove forall 72 € Ww and 7m € Ww, we have that 7777 - 7 m-n+n- 
*6. Let 72 © w. Using the previous two exercises, prove Theorem 4.3.13 by 
showing that the set / {m Ew:m-n Nn - m} is inductive. 
7. A natural number 7 is even if nmn=2-k for some few. If 
n 2-7-+ | forsome 2 € w, then 7 is odd. Prove that for all n © w 
, either 72 is even or 72 is odd. 

8. Let J = {n €w: 7(n is even and n is odd)} . Prove that J is 
an inductive set. Conclude that no natural number is both even and odd. 


9. Prove forall 772, 72, and je in W, that m tk — m”. mi . 


k 


10. Prove for all 772, 72, and ke in w, that (m . nk "1" 3 nk. 


11. Prove for all 772, 7, and /& in WwW, that (m”)* mmk 


Exercise Notes: For Exercise 1, suppose 7? =f () and then use Theorem 4.1.6. For 


Exercise 2, assume that 7 sf () and use Exercise 1 to prove that 777 Q). For 


Exercise 3, Theorem 4.1.12 can be used in the inductive step. 


4.4 Order on w 


We now want to develop a theory of order on the natural numbers. First, we 
define the “less than” relation on the natural numbers. Then we prove that this 
relation satisfies the trichotomy law (see Theorem 4.4.9) as well as the standard 
connections with addition and multiplication. How can we get such a relation in 
set theory? 


Recall that for each nm € w, we defined the successor of 7 to be the natural 
number n~ nU{n}, and thus, y € n+. As a result, we have the following 


infinite membership list: 9 € | € ZESE4E-::-, 


which is very much like the usual relation < on the natural numbers; that is, 


It turns out that the order relation that we need is just “€ ,” and, in fact, we will 
prove that this is the “less than” relation that we were looking for. 


Definition 4.4.1. We say that 772 is less than 7? if and only if 772 € 7, whenever 772 


and 72 are in &). 


In this section, we will view “€ ” as the “less than” relation on the natural 
numbers, and we shall no longer use the symbol < in our initial discussion of 
ordering on w. We will show that the relation € on w satisfies the trichotomy 
law, and we will show that € is preserved under the operations of addition and 
nonzero multiplication. 


Theorem 4.1.11 and Theorem 4.1.13 assert, respectively, that each natural 
number 7 is a transitive set and that w is a transitive set. Hence, we have the 


following result. 


Proposition 4.4.2. Let n be a natural number. Then 


(1) if ken, thn kCn; (2) if a€k and ken, then a€n; (3) if 
kew,thnkCw;(4) ifaeEk andkew,thnaew. 


Item (2) proclaims that € is a transitive relation on w. One can also prove, 
without using the regularity axiom, that no natural number is “less than” itself. 


Lemma 4.4.3. Let n € w. Then n ¢ n. 


Proof. See Exercise 10 on page 88. LI 


Using Definition 4.4.1, we can now easily define the “less than or equal” 
relation on the natural numbers. 


Definition 4.4.4. We write 77 € 7 if and only if 72 € nm or M 7 , whenever 77? 


and 72 are in &. 


Since € is a transitive relation on w, it thus follows that € is also a 
transitive relation on w. Clearly, € is reflexive. We will now show that it is 
antisymmetric (see Definition 3.4.1). 


Lemma 4.4.5. Let m € w andne€w.If men andn€m,thenm=n. 


Proof. Let m € w andn € w satisfy m En and n€m. Thus, m Cn and 
n C m by Proposition 4.4.2(1). Therefore, m =n. LI 


Hence, € is a partial order on w. Is the relation € a total order on w? To 
show that it is, we shall prove that the relation € satisfies the trichotomy law on 
Ww. First, we establish two lemmas. 


Lemma 4.4.6. For all m € w, we have that 0 € m. 


Proof. We shall prove that the set [={me€w:0Em} 


is inductive. Clearly, 0 € J]. Assume m € J . Hence, 0 € m. So either () = m 


or 0 € m. If 0 =m, then, as m € mt, we see that 0 € m+. If 0 € m, then 
we have () € m and yp € m*. By transitivity (see Proposition 4.4.2(2)), we 
conclude that § € m+. Thus, in either case, 0€ m™ and so, mteE]. 
Therefore, / is inductive and J = w.U 


Lemma 4.4.7. For all natural numbers m and n, if m€n, then y,* € n*. 


Proof. Let m€w. We prove that (Vn €w)(meEn—+m* En) by 
induction. Let 


I={né€w:(men+m' En" )}. 


Clearly, 0€ J as (m€0-—-m* €0*) is true vacuously. Let n € J]. To 
prove that »+ € J, assume that » € m+. We shall show that m* € (n*)*. 
Observe that (A) n* € (n*)*. Because m € n*,either or If , then 


as. .Since ,weseethat( )andtransitivity imply that .If , then 
Thus,( )impliesthat .Therefore, .U 


Corollary 4.4.8.If and  arein’ ,then if and only if 


Proof. Assume that and. By Lemma 4.4.7, we just need to prove if , 
then .Solet .Recallthat( ) .Since ,wehaveeither or .If , 
then by( )and transitivity. Also,if ,then( )impliesthat .U 


The following theorem shows that is atotal order on _ ; that is, for all 
and in  ,either or 


Theorem 4.4.9 (Trichotomy Law on). Forall and _ in_ , exactly one of 
the three relationships —__ holds. 


Proof. First, we show that any two of the relations , , cannot hold at the 
same time, whenever and _ are natural numbers. To do this, we just have to 
show that the following two conjunctions do not hold: 


when and . Conjunction (a) implies that , which contradicts 
Lemma 4.4.3. Conjunction (b), by transitivity, also implies that . Hence, both 
(a) and (b) do not hold. 


We now prove that’ .To prove this statement, let —_ and let 


We will show that is inductive. Lemma 4.4.6 implies that .Let _, that is, 
assume that 


We must show that 


(4.6) 


To do this, we consider each of the three cases in (IH) separately. If , then 


by the fact that and transitivity. If , then since . If  , then 
Lemma 4.4.7 implies that . Thus, either or. So in each of these three 
cases in (IH), we obtain one of the conjuncts in (4.6). Therefore, , and the 


proof is complete. L 


The trichotomy law will now allow us to do the following: 


(1) identify an interesting connection between the relations and on the 
natural numbers; 

(2) prove that addition and multiplication on the natural numbers preserve the 
“less than” relation; (3) prove that , the set of natural numbers, satisfies the 
crucial well-ordering principle. 


Corollary 4.4.10.If and arein ,then if andonlyif 


Proof. Let and .Assumethat . Because is transitive, we have that 
Since , Theorem 4.4.9 implies that .So_ . To prove the converse, assume 

. Thus, . From Theorem 4.4.9, we have either or .If  , then we would 
have that ,as_ . This contradicts Theorem 4.4.3. Thus, .U 


Corollary 4.4.10 implies that for all natural numbers and_ , 


When and _ are natural numbers, we can now define __, which is the larger 
of the two natural numbers) and _ (see Exercise 5). 


Our next theorem shows that the expected properties of “inequality” hold for 
the natural numbers. 


Theorem 4.4.11 (Properties of Inequality). For all natural numbers , __, and 
, the following hold: 


(1) if and only if 
(2) If ,then if and only if 


Proof. Item (1) follows from Exercises 6 and 7 of this section. Similarly, (2) is 
established via Exercises 8 and 9. L] 


Corollary 4.4.12 (Cancellation Laws). For all natural numbers , ,and_ 4, 
the following hold: 


(1) If ,then 
(2) If ,then implies 


Proof. See Exercise 10. 


Corollary 4.1.5 (page 85) confirms the fact that the principle of mathematical 


induction is provable from the first seven axioms of Zermelo—Fraenkel (ZF) set 
theory. Virtually every undergraduate text in mathematics (e.g., number theory, 
real analysis, and abstract algebra) just cites this principle and does not attempt 
to prove it directly. Our next theorem shows that these seven axioms also imply 
the well-ordering principle: Every nonempty subset of _ has a least element. 


Theorem 4.4.13 (Well-Ordering Principle). Let. be a nonempty subset of 
Then has a least element; that is, thereisan suchthat forall 


Proof. Let be anonempty subset of . Suppose, for a contradiction, that 
has no least element. Let .So isthe set of that are less than or equal to 
every element in. .Notethat( )  becauseif ,then and  wouldbea 
least element in 


We will now show that _ is inductive. Clearly, , by Lemma 4.4.6. Let ; 
that is, suppose that forall .Since has no least element, it follows that 
. Therefore, forall .Hence, forall by Exercise 3.So  and_ is 
inductive. Thus, and( )nowimplies that . Nevertheless, because and 
is nonempty, we also conclude that . This contradiction completes the 
proof. L 


We end this section with a proof of the strong induction principle on 


Theorem 4.4.14 (Strong Induction Principle on ).Let} beasubsetof and 
suppose that 


(4.7) 


Then 


Proof. Let satisfy (4.7). Suppose, to the contrary, that there is a natural 


number that is not in. . Thus, the set — is nonempty. Theorem 4.4.13 now 
implies that hasaleast element .Hence, andif ,then  . Therefore, 
. As _ Satisfies (4.7), we infer that , a contradiction. LJ 


We can now define a special relation symbol with which we are all familiar. 
Let and bein .Wedefine ifandonlyif ,anddefine if and only 
if .So in the future there may times when we shall use the symbols and , 
rather than and _, if it seems appropriate. 


Remark 4.4.15. Using the system of natural numbers, one can give set- 
theoretic constructions of the set of integers, the set of rational numbers, and 
then the set of real numbers. A method for constructing the set of integers is 
suggested in Exercise 9 on page 82. In [5], Enderton gives a more detailed 
description of this method and constructs the set of rational numbers and the set 
of real numbers. 


Exercises 4.4 


1. Let) . Show that 
2. Let and bein) .Showthatif then 


3. Let and bein) .Showthatif then 
4. Let be inductive and let . Prove that is inductive. 


5. Let and bein, . Suppose that) . Prove that 

6. Let and be natural numbers. Prove that forall) ,if  , then 
7. Let and| |. Provethatforall ,if then 

8. Let and be natural numbers. Prove that forall) ,if  , then 
9. Let and| |. Provethatforall ,if  , then 


10. Use Theorem 4.4.11 and Theorem 4.4.9 to prove Corollary 4.4.12. 
11. Let .Provethat , forall 


12. Let .Provethatforall ,if then for some 
13. Let and) bein .Provethat ifandonlyif forsome 


14. Suppose that satisfies for all 


(a) Prove that is an increasing function; that is, show that for all and 


all) ,if then 
(b) Prove that is one-to-one. 


15. Prove that there is no function satisfying _ for all 

16. Let and let be nonempty. Prove that has a largest element; that is, there 
is an such that) for all 

17. Let) .Provethatforall , the set has a largest element. 

18. Proveforall) ,if| ,then  , forsome 


19. Let bein) .Supposethat . Let 


Prove that is an inductive set. 


20. Let be in with . Assume that where , and where . Prove that 
and) .To prove that, suppose to the contrary that (without loss of generality). 


By Exercise 12, there is a sothat| . Complete the following: 

(a) Show that 

(b) Show that and) .|Thenshowthat , which is a contradiction. Conclude that 
(c) Now show that 


Exercise Notes: For Exercise 3, apply Theorem 4.4.9. For Exercise 4, apply 
Exercise 3. For Exercise 6, use induction. For Exercise 7, apply Exercise 6 and 
Theorem 4.4.9. For Exercise 8, use induction. For Exercise 9, use Exercise 8 and 
Theorem 4.4.9. For Exercise 13, use Exercises 11-12. For Exercise 14, let 
and prove that is inductive. For Exercise 15, one can apply Theorem 4.4.13 
and proof by contradiction: Assume that there is such function .Let  . Let 
be the least element in . Thus, fora. For Exercise 16, use the well- 
ordering principle and Exercise 2. For Exercise 17, use induction and 
review 3.3.8(3) on page 60. For Exercise 18, prove that 


is an inductive set. For Exercise 19, in the inductive step one can use Exercise 3. 
Exercises 19-20 yield a proof the Division Algorithm: Let and _ be natural 
numbers, where. . Then there exist unique natural numbers and such 
that and 


5 
On the Size of Sets 


The size of a finite set can be measured simply by counting; for example, the 
size of the set X = {1,2,3,4,..., 10} is 10 because it has 10 elements, and 
the size of the sets Y = {m,2,30} and Z = {9,11,—1} is 3. Clearly, the 
size of X is bigger than the size of Y. Moreover, the sets Y and Z have the 
same size. Can the concept of “size” be extended to infinite sets? Is there a way, 
which does not involve counting, of showing that two sets have the same size? 
Georg Cantor was the first mathematician to seriously address and answer these 
questions. Cantor found a way to measure the size of any infinite set. He first 
observed that two sets A and # have the same size if there is a one-to-one 
correspondence between A and J3, that is, there is a way of evenly matching 
the elements in 4 with the elements in f . In other words, Cantor observed that 
A and # have the same size if there is a bijection f: A— B. 


The arrow diagram in Figure 5.1 represents a bijection. As a result, we can use 
this bijection to construct the following one-to-one correspondence (5.1) 
between the sets A and B: 


(5.1) 


Hence, the function f allows us to set up a pairing between the elements in 4 
and the elements in 8 such that each element in A is matched with exactly one 
element in #8, and each element in # is thereby matched with exactly one 
element in A. Cantor observed that we can now conclude, without counting, 
that the sets A and # have the same size. 


Figure 5.1. Arrow diagram of a bijection For another illustration, let 
E={k€w:kis even}, and let f: w—- EF bedefined by f(n) = 2n. As 
f isa bijection, we obtain the following one-to-one correspondence between the 
set w of natural numbers and the set #) of even natural numbers: 


we: PF FTF gs 4 F 6 -. Mn 
foo oe a a ok od, Oe OD 
Be @ 246 8 10 2 «en Da 


Therefore, each natural number 7 corresponds to the even number 27, and 
each even natural number 27 is thereby matched with 7 € w. The bijection 
f: w— FE specifies a one-to-one match-up between the elements in w and the 
elements in / . Cantor concluded that the sets w and # have the same size. 


After discovering how to determine if two infinite sets have the same size, 
Cantor proved that the set Q of rational numbers has the same size as the set w 
of natural numbers. As a result, Cantor then conjectured that the set of real 
numbers also has the same size as w. It came as a complete surprise to Georg 
Cantor when, in 1874, he discovered that these two infinite sets have different 
sizes. In fact, Cantor showed that the set of real numbers is much larger than the 
set of natural numbers. This unexpected result would have an enormous impact 
on the future of mathematics. 


When Cantor first presented his new research on the size of infinite sets, some 
of his contemporaries actually refused to acknowledge his discoveries. 
Moreover, Henri Poincaré referred to Cantor’s work as a “grave disease” that 
would infect mathematics. Nevertheless, Cantor and his important ideas would 
eventually be recognized. In 1904, the Royal Society presented Cantor with its 


prestigious Sylvester Medal for his “brilliant” mathematical research. David 
Hilbert, a very influential mathematician, described Cantor’s work as 


the finest product of mathematical genius and one of the supreme 
achievements of purely intellectual human activity. 


In this chapter, we shall investigate Cantor’s early work in set theory. First, 
we will formally define and examine the notion of a finite set, and then we will 
classify sets into two categories: sets whose elements can be enumerated in a 
sequence indexed by the natural numbers (countable sets) and those sets for 
which it is impossible to so enumerate all of their elements (uncountable sets). 
We will also examine Cantor’s fundamental definitions and theorems on the 
cardinality (size) of sets. 


5.1 Finite Sets 


A set is infinite, of course, if it is not finite. Moreover, a set is finite if it has at 
most ” many elements for some natural number 7 . The next definition is just a 
mathematical rewording of this notion. 


Definition 5.1.1. A set is finite if and only if there is a one-to-one function 


Si : X — mn for some natural number 72 . 


It follows, vacuously, that the empty set is finite. Now that we have a formal 
definition of a finite set, we can now use the Well-Ordering Principle 4.4.13 to 
precisely define the notion of the “number of elements” in a finite set. 


Definition 5.1.2. Suppose that Y is a finite set. Let 72 be the least natural number such 


that there is a one-to-one function ‘a - X —+ n.Then 7 is the number of elements in 


the set XX , and we write |X | n. 


In particular, we have that |@| = 0 (see Remark 3.3.7 and the paragraph just 
prior to Definition 3.3.17). 


Theorem 5.1.3. Let 4 be finite and let nm = |A|.If f: A — 7 is one-to-one, 
then f is onto n. 


Proof. Let 4 be a finite set. If 4 —@, then |A| =O and any function 
f: A— 0 is vacuously onto 0. So now assume that A # ©. Let n = | A] and 
let f: A—+ n be one-to-one. Suppose, to the contrary, that f is not onto n. 
Let k € n beso that k ¢ ran(f). Because n # 0, there is an m € w such that 
n=mt. Since k € n= m?*, we have either k =m or ke m. If k=m, 
then f: A—m and f is one-to-one. As m € n, we conclude that |A| € n, 
which is a contradiction. If & € m, then define the function g: A + m by 


k, if fia) = mi: 
g(a) 
f(a), if f(a) #m. 


Since f is one-to-one, it follows that g: A — m is also one-to-one. Moreover, 
because m € n, we conclude that |A| € n, which is again a contradiction. UJ 


Corollary 5.1.4. If A is finite and m= |A|, then there is a_ bijection 
f:Avn. 


Proof. Suppose that A is finite and let nm = |A| . By Definition 5.1.2 there 
exists a one-to-one function f: A — mn. Theorem 5.1.3 implies that f is a 
bijection. LJ 


Theorem 5.1.5. Let f: A — A be one-to-one where 4 isa finite set. Then f 


isonto 4. 


Proof. Let f: A— A be one-to-one where A is finite. Let nm € w be such 
that |A| =n. By Definition 5.1.2, let fh: Asn be one-to-one. 
Theorems 3.3.19 and 5.1.3 imply that (ho f): A — n is a bijection. To prove 
that f isonto 4, let y € A. Since h(y) € n, there exists an 2 € A such that 
(ho f)(x) = h(y). So h(f(a)) = h(y), and thus, f(a) = y as fh is one-to- 
one. Hence, f isonto 4.U 


Corollary 5.1.6. Let f: A— A be one-to-one and not onto 4. Then 4 is 
infinite. 


The following theorem confirms that two finite sets have the same number of 
elements if and only if there exists a one-to-one correspondence between the two 
sets. 


Theorem 5.1.7. Let A and § be finite sets. Then there is a bijection 
h: A—- B ifand only if |A| = |B]. 


Proof. See Exercise 4. LJ 


Recall that n* = {0,1,2,3,...,m} . Since the identity function from n* to 
n* is one-to-one, we see from Definition 5.1.2 that |n*| € n™. 


Theorem 5.1.8. For all natural numbers 7, we have that |n| = n. 


Proof. We prove that J = {n € w: |n| = n} is an inductive set. As |@| = 0, 


we see that |0|=0 and hence, Qe J. Let ne. Thus, |n| =n. Let 
m= |n*| and let F: n+ — m be one-to-one. So m € n*. To prove that 
m = n*, Suppose to the contrary that 7 € 7+. Therefore, (aA) ™ € 7. Since 
F: n* —s m is one-to-one, it clearly follows that (#° | n): n — m is one-to- 
one. As |n| =n, we have that n € m. It now follows from (A) that m= 7. 
Because |n| = n and (F | n) is one-to-one, Theorem 5.1.3 implies that F’ | n 
is onto m. Since F'(n) € m, it follows that F(n) = F(i) for some i En. 
This contradicts the fact that / is one-to-one. Therefore, »,—»* and so 
ntey.U 


Corollary 5.1.9 (Pigeonhole Principle). Let m and nm be natural numbers and 
let f: n — m bea function. If m € n, then f is not one-to-one. 


Theorem 5.1.10. The set w is an infinite set. 


Proof. Suppose w is finite. Thus, there is a one-to-one function F’: yw — n for 
some 7. € w. Since /’ is one-to-one, the restriction (F ck Mis n* — n isalso 
one-to-one, and this contradicts Corollary 5.1.9. 


Corollary 5.1.11. If G: w — A is one-to-one, then A is an infinite set. 


Proof. See Exercise 21. LJ 


The pigeonhole principle implies a “dual” version involving surjections. 


Lemma 5.1.12. If m €n and f: m-—n where m€w and n€ w, then f 
is not onto n. 


Proof. Let mm €mn where nm and m are natural numbers. Suppose, to the 
contrary, that f:m-+n is onto m. We can therefore define the function 
g:n—+m by 


g(x) = the least i € m such that f(i) = a. 
(5.2) 


We now show that g is one-to-one. Let zx €n and y€v7 be such that 
g(x) = g(y). Hence, f(g(a)) = f(g(y)). By the definition (5.2) of 9, we 
have that ®©=—¥Y. So g:n-—-m™ jis one-to-one, which contradicts 
Corollary 5.1.9, because m € n. LI 


Theorem 5.1.13. If NV is finite and f: N — N is onto N, then f is one-to- 
one. 


Proof. Let NV be finite and f: N — N be onto JV. Also, let n € w be such 
that |N| =n. Assume f(a) = f(b) wherea Ee N, bE N,anda¥b. So 
n #0 and yn — kt for some k € w. Let K = N \ {a} . Thus, |K| =k (see 
Exercise 13) and (f | A): K - N is onto N. Since |K| =k and |N| =n 
, there is a function g: & — n that is onto n (see Exercise 14), contradicting 
Lemma 5.1.12, as & € n. Therefore, f is one-to-one. 


Theorem 5.1.14. If A is a finite set, then P(A) is finite. 


Proof. We will use proof by induction. Let 
I= {n€w: for all X, if |X| =n, then P(X) is finite}. 


If |X| =0, then ¥ = @ and P(X) = {©} is finite. Hence, 0 € J. Let 
n€ J]. To prove that »* € J, let X be such that |X | =n. So X is 
nonempty. Let a € X . Thus, |X \ {a}| = mn (see Exercise 13). As n € J, we 


conclude that P(X \ {a}) is finite. By Corollary 5.1.4, there exists a bijection 
f: P(X \ {a}) 4 m where m = |P(X \ {a})|. Define 
h: P(X) + m-+m by 


h(U) f{U), ifagU; 
| m+f(U\{a}), ifaeu. 
(5.3) 


To prove that f, is one-to-one, suppose that h(U) = h(V) with U € P(X) 
and V € P(X). Thus, h(U) = h(V) =i for some j € 2m. There are two 
cases to consider: i€ m and m€i.lIfiem, then a¢U anda ¢ V, by 
(5.3). Because h(U) = h(V), we conclude that f(U) = f(V). SoU =V 
since f is one-to-one. If m € 7, then (5.3) implies that a € U and a € VY. As 
h(U) = h(V), (5.3) asserts that m+ f(U \ {a}) =m-+ f(V \ {a}). So 
f(U \ {a}) = f(V \ {a}). Hence, U \ {a} = V \ {a} because f is one-to- 
one. It therefore follows that [’ = )’. Since h: P(X) — 2m is one-to-one and 
2m €w, we have that P(X) is finite. Thus, »*+ € J and the proof is 
complete. L] 


Exercises 5.1 


1. In our proof of Theorem 5.1.3, under the case that fr € 777 , we did not show that 


the function G is one-to-one. Complete the proof by proving that J is one-to- 
one. 


2. Suppose that the set J is finite and A é BB . Prove that A. is finite. 

3. Let A and J} be two finite sets, and let f : A — B be one-to-one. Prove 
that | A] < |B]. 

mA; Using Exercise 3 and Corollary 5.1.4, prove Theorem 5.1.7. 

*5. Let € w andlet f: nm — A . Prove that f [n| is finite. 

6. Let A be finite and let Ff: A — w. Prove that ran(f) C k, for some 
Kew. 

7. Suppose that F J : A —+ n is a bijection, where 72 € W. Prove that 
|A| =n. 

8. Let Le Ww & WwW —- w be defined by fG,3) — 9% . 3). Prove that f is 


one-to-one. Prove thatif i < 7 and 7 < m,then f(i, 7) < f(m,n). 

*9. Prove that DP: WX W—> W defined by p(i, 7) 2' (2) | 1) = | 
is one-to-one and is onto WJ. Prove that if 7 € ym and E € mn, then 
p(i, 3) € p(m,n). 

10. Prove there exists a one-to-one function Pi >: W—-+> Ww &X Ww that is onto 
WK WwW. 

11. Suppose that A and /9 are disjoint finite sets. Prove that 4 J) /9 is finite. 

*12. Let A be an infinite set and [3 be a finite set. Prove that A \ B is 
infinite. 

“13. Let fp € yw. Suppose that 
|A\ {a}| =k. 

“14. Suppose that | K| k, || n,and f: K -+ N is onto N 
where f; € yy and 7 € W. Show that there is a function gq: ke —> n that 


A| k* and qe A. Prove that 


is onto 7? . 

“15. Suppose that A and J9 are finite sets. Prove that A J f9 is finite. 

16. Let 2 € W. Suppose that f: nm + A isonto A . Prove that A is finite. 

“17. Let A bea finite set. Prove that there exists an ff: 7. A that is onto 
A forsomen € W. 

18. Suppose that Sj - A + B is onto B where A is finite. Prove that 3 is 
finite. 

19. Let 7m € W and {A ,it€ n} be such that 4 ; is finite for all 7 E n. 
Prove, by induction, that U ien A; is finite. [Use Exercise 15 in the inductive 
step. ] 

20. Suppose that A and 9 are finite sets. Prove that 4 ~% /9 is finite. 

* 21. Prove Corollary 5.1.11. 

* 22. Let (A; ‘sel ) be an indexed function where J is a finite set and 
A i f- @ for all ¢ € J. Without using the axiom of choice, prove by 
induction on | I | that there is an indexed function (x ,:2El ) such that 
x; € A; foralie J. 

“23. Using Exercise 7, modify the proof of Theorem 5.1.14 to show that if 


| A| —= mn, then |P(A)| — gn 


Exercise Notes: For Exercise 6, apply Exercise 18 on page 109. For Exercise 7, use 
Theorems 5.1.7 and 5.1.8. For Exercise 9, if ] m, then m— | = U m. (see 


Theorems 4.1.6 and 4.1.10). To prove } is onto W, apply Theorem 4.4.13 to prove that 
foraln €w,if 1 En, then y = 2* (2; T 1) for some 7 € w and 7 Cw. 
Note: if & € uw ,then & — fp? — |. For Exercise 10, use Exercise 9. For Exercise 12, 
apply Exercise 11. For Exercise 16, let [,, = {i En: f (i ) = a} for aE A. 
Since f is onto A, each the set /,, is nonempty and has a least element. For 


Exercise 17, use Corollary 5.1.4. For Exercise 18, apply Exercises 16-17. For Exercise 20, 
apply Exercise 9. For Exercise 21, recall Theorem 3.3.19. For Exercise 22, the result is true 
vacuously if [J | (). For Exercise 23, note that if 77 & 21€m-+m., then 


Exercise 12 on page 108 and Theorem 4.4.11(1) imply that 7 m+k forakem. 


5.2 Countable Sets 


Recall that w = {0,1,2,3,...} is the set of natural numbers. A set is countable 
if it has the same size as some subset of w. In other words, a set is countable if 
there is a one-to-one correspondence between the set and a subset of w. Our 
next definition captures this concept in mathematical terms. 


Definition 5.2.1. A set .¥ is countable if and only if there exists a one-to-one function 


[:X -w. 


So by Definition 5.1.1, every finite set is countable (as n C w, for all n € w 
). Every subset § of w is also countable, since the identity function i: S + w 
is one-to-one, where i(a) = a for all « € S. Thus, in particular, the set of 
natural numbers w is countable. One can also show that the set of integers and 
the set of rational numbers are countable (see Exercises 4 and 5). We will prove 
in Section 5.3 that the set of real numbers is not countable. 


Definition 5.2.2. A set is countably infinite if it is countable and infinite. 


For example, the set w is countably infinite as it is countable and infinite by 
Theorem 5.1.10. When one can prove that a set is countable and it is clear that it 
is infinite, then we can conclude that the set is countably infinite. 


Theorem 5.2.3. Suppose that A and # are sets where # is countable. If there 
is an injection g: A+ B,then A is countable. 


Proof. Let A and 8 besets where #8 is countable. Let g: A - B be one-to- 
one. We shall prove that 4 is countable. Since 8 is countable, there is a one- 
to-one function f: B — w. Thus, by Theorem 3.3.19, (fog): A—w is an 
injection. We conclude that 4 is countable. L] 


Theorem 5.2.4. If 8 is a countable set and A C B, then A is countable. 


Proof. Assume that #8 is a countable set and A C #. Let i: A-+ B be the 
identity function, that is, i(@) =a for all « € A. Since ; is one-to-one, 
Theorem 5.2.3 implies that 4 is countable. L] 


Theorem 5.2.5. If 4 and #8 are countable sets, then 4 \J 9 is countable. 


Proof. Let A and # be countable. So there are one-to-one functions 
f: A—w and g: B > w. Now define the function h: AU B - w by 
2: f(z), if z € A; 
A(z) = F(z) re 
2-g(a)+1, iffe B\A, 
(5.4) 


for each x € AUB. We prove that h: AU B - w is one-to-one. Let x and 
y be in AUB and assume h(a) = h(y). We shall prove that « = y. First, 
because h(x) = h(y), we cannot have x € A and y€ B\ A. To see this, 
suppose that x € 4 and y€ B\A. So, as h(x) = h(y), we infer that 
2- f(a) =2-g(a)+ 1 from (5.4); but this is impossible because a natural 
number cannot be both even and odd. Similarly, we cannot have y € A and 


«x € B\ A. Hence, either x and ¥ are both in A, or they are bothin B\ A. 
If x and y are in A, then 2- f(a) =2- f(y). Thus, f(a) = f(y) and so 
&=Y, because f is one-to-one. If « and Y are in B\ A, then 
2- g(x) +1=2-g(y) +1. We conclude that g(a) = g(y). Since g is one- 
to-one, we have « = y. Therefore, /, is one-to-one and A U BP is countable. LJ 


One can prove by mathematical induction, using Theorem 5.2.5, that a finite 
union of countable sets is also countable. We will not do this here, because we 
will soon prove a more general result (see Corollary 5.2.10). 


Our next result demonstrates that every countably infinite set can be put into a 
one-to-one correspondence with the set of natural numbers. 


Theorem 5.2.6. If A is countably infinite, then there is a bijection g: w+ A. 


Proof. Let 4 be a countably infinite set. Thus, there exists a one-to-one 
function f: A—w. Since A is not finite, the range of f must be an infinite 
subset of w. Let Y =ran(f). So f: AY is a bijection. Hence, 
f~': Y +A is also a bijection, via Theorem 3.3.18. Because Y is an infinite 
set of natural numbers, there is a bijection A): w— Y by Exercise 13. Let 
g = f~'oh. Theorems 3.3.19 and 3.3.20 now imply that g:w—A isa 
bijection. LJ 


Corollary 5.2.7. A set A is countably infinite if and only if there is a bijection 


giw— tA. 


Proof. If A is countably infinite, then Theorem 5.2.6 implies that there exists a 
bijection g: w — A . Conversely, suppose that g: w — A isa bijection. Since 
g is one-to-one, A_ is infinite by Corollary 5.1.11. As 9 is onto A, 
Theorem 3.3.18 asserts the existence of the inverse function g~!: A — w that 
is one-to-one. Hence, A is countable and infinite. LJ 


Theorem 5.2.8. If A is a countably infinite set, then there exists an 
enumeration @o,@1,@2,..., Qn,--- Of all the elements in A such that each 
element in 4 appears in this enumeration exactly once. 


Proof. Assume that 4 is countably infinite. Thus, by Theorem 5.2.6, there is a 
bijection g: w + A. For each n € w, let ay g(n). Since g is one-to-one 
and onto A, it follows that the enumeration @o, @1,@2,---,@n,--- lists every 
element in 4 exactly once. L1 


Theorem 5.2.8 implies that each countably infinite set is also denumerable; 
that is, we can list the elements of a denumerable set in the same way that we list 
the natural numbers, namely, 0,1,2,3,4,5,.... We will soon show that it is 
impossible to list all of the real numbers in such a manner. In other words, Ris 
not denumerable. 


Theorem 5.2.9 (AC). If {A;:i€w} is such that A; is countable for each 
i € w, then U Ai is countable. 


~1cw 


Proof. Suppose that {A; : i € w} isa set of countable sets. Because each A; is 
countable, the axiom of choice implies that for every i € w there is a one-to-one 
function f;: A; — w. In other words, by the axiom of choice, there is a choice 
function (f; : i € w) for the indexed function (F; : i € w) where each F; is 
the set of all the one-to-one functions of the form h: A; > w. 


Now let p: w X w — w bea one-to-one function (see Exercise 9, page 116). 
Define the function 9: Uiew Ai 7 w by 
g(x) = p(i, fi(a)) where 7 is the least i € w such that x € Aj, 


for all x € ),-., Ai. To prove that g is one-to-one, assume that g(a) = g(y) 
where a and y are in \),-,, A;. Let i € w be the least natural number such 
that 2 € A;, and also let j €w be the least such that y € A;. Since 


g(x) = gly), we have p(i, fi(x)) = p(j, fj(x)). Because Pp is one-to-one, it 
follows that i= j and fi(a) = f;(y). Thus, f;(a) = f;(y) andso = y, as 
f; is one-to-one. Therefore, g is one-to-one and ();-.,, A; is countable. 0 


We can now prove that a countable union of countable sets is also countable. 
Therefore, countable sets can be used to construct many more countable sets. 


Corollary 5.2.10 (AC). Let .4 be countable and suppose that each B € A is 
also countable. Then |_J.A is countable. 


Proof. Let ,4 be as in the statement of the corollary. Thus, there is a one-to-one 
function fA}: Asw. If A=@, then LJA @ is countable. So assume 
Ao and let By € A be fixed. Consider the indexed set {A;:i € w} 


As: B, if i€ran(h) and h(B) =i; 
‘ | Bo, if i¢ ran(h), 


defined by 


for each i€@w. Therefore, A; is countable for every i€w, and 
UA U Ai Theorem 5.2.9 now implies that _).A is countable. L] 


Since a finite set is countable, Corollary 5.2.10 implies that a finite union of 
countable sets is also countable. 


Corollary 5.2.11 (AC). If the indexed set {A;:i€/} is such that J is 


countable and A; is countable for each j € J, then U Aj is countable. 
te 


Let A bea set and let 7 be a natural number. Recalling Definition 3.3.6, we 
have that "A= {f: f isa function from n to A}. The set "A _ can be 
viewed as the set of all sequences of elements in 4 of length n. 


Theorem 5.2.12. Let A be a countable set. Then ™A4 is countable, for all 
new. 


Proof. Let A be countable. Thus, there is a one-to-one function “/: 4 —s yw. Let 
I ={n€w: "A is countable} . We shall prove that J is inductive. Since 


°A4 = {@} (see Remark 3.3.7), the set 94 is countable. Thus, 0 € /. Let 
n€J]. Therefore, "4 is countable. So let g: "A—+w be one-to-one. By 
Exercise 9 on page 116, there is a one-to-one function p: w X w — w. Define 


he mtd 5 y by MF) = Plo In), Fr) 


for all f ¢ ™ 4. We will now prove that fh is one-to-one, and thus conclude 
that n*4 is countable. Let f eA and gE” A. Assume that h(f) = h(q). 
Therefore, P(9(F I 7), €(f(m)) = plg(a In), (a(n). 


Because Pp is one-to-one, we conclude that 


WF | nr) = g(q | n) and &(f(n)) = e(q(n). 


Since J and ¢ are one-to-one, we infer that f | n= q|n and f(n) = q(n). 
Thus, by Lemma 3.3.5, f = q.So fh is one-to-one. Therefore, p+ ¢€ 7. U 


_ By applying Theorem 2.1.3, one can show that the following class is a set: 
““A = {f : for some n € w, f is a function from n to A}. 


So for every function f:m-— A, where n €w, we have that fe S%A. 
Clearly, 


of oe eh 


new 


(5.5) 


Theorem 5.2.13 (AC). Let 4 bea countable set. Then ©“ 4 is countable. 


Proof. If A is countable, Theorem 5.2.12 asserts that ™A4 is countable, for 
every m €w. Equation (5.5) and Theorem 5.2.9 thus imply that ©“, is 
countable. LJ 


Corollary 5.2.14 (AC). If 4 is countable, then the set of all finite subsets of 4 
is also countable. 


Proof. Let /'4 be set of all finite subsets of A. Define the function 
G: “A —- F4 by G(f) = ran(f). Exercise 17 on page 116 implies that G 
is onto /’\4. Hence, F’4 is countable by Theorem 5.2.13 and Exercise 10. LJ 


Remark 5.2.15. With a little more work, one can prove Theorem 5.2.13 (and 
Corollary 5.2.14) without appealing to the axiom of choice. We outline two such 
proofs below. The first proof is inspired by the proof of Theorem 5.2.12, and the 
second proof applies the fundamental theorem of arithmetic (see [1, Theorem 
4.7.7)). 


1. Suppose that ¢: 4 —s w and p: w X w — w are one-to-one functions. 
By the Recursion Theorem 4.2.1, one obtains the indexed function 
(hy 2m € w) , where h,,: "A — w for all n € w, such that 

(1) ho(@) = 0 (recall that °A = {2} ); (2) 


hy+(f) = plhn(f | n),e(f(n)) forall fen". 


One can prove that each h,,: A — w is one-to-one by induction. Clearly, hy 
is one-to-one. In the inductive step, one applies the argument used in the proof 
of Theorem 5.2.12. One now “glues the functions /,, together” by defining 


G: A -s w by G(f) = p(n, hn(f)) where n = dom(f). 


t 


One can now prove that G is one-to-one and thus, <“ 4 is countable. 


2. Let ¢: 4 —+ w be one-to-one. One must first prove that ©, is countable. 


To do this, define H: —w > 


>W by 
l, if dom(f) = 0; 
H(f) 


QF (O)+1 , BAA)41 . 5FQ)+1.. yf) if dom(f) = kt, 


where Pi is the j-th prime. By the fundamental theorem of arithmetic, the 
function #7 is one-to-one. Hence, <“,) is countable. Since ¢: A —s wy, define 
G: A> &w by G(f)=of. Therefore, if f:n—-+A, then 
G(f): n — w (see Theorem 3.3.10). As ¢ is one-to-one, it follows that G is 


one-to-one. Therefore, because ©“, is countable, Theorem 5.2.3 implies that 
€w 4 is countable. 


Exercises 5.2 


1. Le A = {4,8,12,16,...} andle B={nE€Z:n< —25}. 
Define a bijection E : A — w,and define a bijection g: Bow. 

2. Let 4 and J} beas in Exercise 1. Define an injection h: AU B+ w. 

3. Show that the set of negative integers 7 is countable. 

*4. Conclude from Exercise 3 that set of integers 7 is countable. 

*5. Let Dp: WX Ww —> W be one-to-one (see Exercise 9 on page 116). Using P, 
show that the set of positive rational numbers Q * is countable. Conclude that 
the set of negative rational numbers Q is countable and the set Q is 
countable. 

6. Let A and #3 be two countably infinite sets. Prove that there exists a bijection 
f:AsB. 

7. Let 4 and J} be countable sets. Prove that 4 x /? is countable. 


8. Let A be a set. Suppose that f: w —- A is onto A . Prove that 4 is 
countable. 

9. Let A be a nonempty countable set. Prove that there exists an f -woA 
that is onto A . 


* 10. Suppose that f; A —+ B is onto where A is countable. Prove that 
PB is also countable. 


“411. (AC) Let A be an infinite set and let C P( A) \ { a} . By 
Theorem 3.3.24, there exists a function g : C — A such that gf B ) eB 
for all (3 € C. One can show (see Theorem 6.2.3) that there is a function 


ee ee h(n) = g({A \ h[n]), for all n € w. 


Prove that /,: yu —$ A is one-to-one. 


12. Let 4 bea finite set. Prove that ™ 4 is finite, forall 2 € W. 
"13, Let ¥ C w be infinite and @ be the least element in Y. Define J yy —_ Y 


by f(n) = the least element in {k EY :ne€ k}. 


By the recursion theorem, there is a function fy: 4 —s Y such that 


GQ) h(0) =a.) h(n*) = f(h(n)), forall n Ew. 


Show that h(n) = h(n r) for each 72 € &. Conclude, from Exercise 14 on 
page 108, that / is an increasing, one-to-one function. Prove that for each fr € yy there 
isan 7 € qu sothat & € h(i) . Prove that ran(h) Y and that epee 1s 


a bijection. 


14, Let Z|] be the set of polynomials in x with integer coefficients, that is, 


Z |x] = {ap +ayxa+---+axz* : k € w and a; € Z for all i € k*}. 


Define a surjection f : ““Z — Z|ax|. Now show that Z|2| is countable. 


15. A real number a is said to be algebraic if p(a) = 0 for some p(x) € Zz 
(see Exercise 14). For p(a) € Za], let Rp) {a€R: p(a) = 0}. Since a 
polynomial in Z|a:] has only finitely many roots, each set /?,(,) is finite. Let A\ 
be the set of all algebraic numbers, that is, A Un (2)¢ Z(x] Ry (x). Prove that 
A is countable. 


Exercise Notes: For Exercise 5, first assume each x € Qt is in reduced form. 
For Exercise 6, first review the three Theorems 3.3.18—3.3.20. For Exercise 8, let 
I, ={n€w: f(n) =a} foreach qa € A. Because f is onto 4, each J, is 
nonempty and has a least element. For Exercise 10, use Exercises 8-9. For 
Exercise 12, modify the proof of Theorem 5.2.12. For Exercise 13, show that for 
every j3¢ Y and jE Y, if ic j, then f(z) € f(j). Now use induction. To 
prove that ran(/) = Y, one can apply Theorem 4.4.13. For Exercise 14, apply 
Theorem 5.2.13 with Exercises 4 and 10. For Exercise 15, use Exercise 14 and 


Corollary 5.2.11. 


5.3 Uncountable Sets 


In Section 5.2, we established the existence of many countable sets. One might 
begin to believe that all sets are countable. Are there sets that are not countable? 
We will soon prove that such sets exist. First, we identify a slightly easier way to 
say that a set is “not countable.” 


Definition 5.3.1. A set .Y is uncountable if it is not countable, that is, if there is no one- 


to-one function £ 7X > WwW. 


Cantor was the first mathematician to prove that an uncountable set exists. In 
his proof, Cantor introduced a new and powerful proof technique that is often 
identified as a Cantor’s diagonal argument. This argument has had a profound 
influence on mathematics ever since its introduction. In particular, diagonal 
arguments now frequently appear in mathematical logic and in computability 
theory. The proof of our next theorem illustrates Cantor’s diagonal argument. 


Theorem 5.3.2. Let F =“{0,1} be the set of all the functions 
f:w-— {0,1} .Then F is uncountable. 


Proof. Let # be as stated in the theorem. We will prove that # is uncountable. 
Suppose, for a contradiction, that the set F is countable. Because ¥ is infinite 
(see Exercise 8), Theorem 5.2.8 implies that there is an enumeration 


(5.6) 


of all the functions in F ; that is, every function in F appears in the list (5.6). 
Define the function g: w — {0,1} by 


1, if f,(i) =0; 
0, if fi(i) =1, 


g(i) 
(5.7) 


for each ij Cw. As g: w + {0,1} , we see that g € F. Since every function 
in ¥ is in the list (5.6), we conclude that the function g appears in this list. 
Hence, there is an n € w such that g = f,,. Therefore, g(i) = f, (7) for all 
i € w. So, in particular, (A) g(n) = f,(m). Since f, € F, either f,,(n) = 0 
or f,(n) =1.If£ f,(m) = 0, then g(n) = 1 by (5.7). Thus, (A) implies that 
1 = 0, a contradiction. If f,(m) = 1, then g(n) = 0 by (5.7). Again, (A) 
implies that 0 — 1, which is a contradiction. Therefore, F is uncountable. L] 


Where is the diagonal? 


Are you wondering why the technique used in the proof of Theorem 5.3.2 is 
referred to as a diagonal argument? To answer this inquiry, we shall now revisit 
this proof. Given a function f: w — {0,1}, there is a way of writing the values 
of f as an infinite sequence of terms from the set {(, 1} . For example, suppose 
fii) =0 if ¢@ is even and f(ij)=1 if j{ is odd. So 
f(0) =0, f(1) =1, f(2)=0, f(3) =1, f(4)=0, ..., 


and we can represent f as follows: f = (0,1,0,1,0,1,0,1,...). 
Furthermore, if h: w — {0,1} is represented by 
= (1, 1,0, 1,.0,.1, doce0)'s 


then ACO) = 1 41) = 1,A(2) = 0,43) = 1, 44) = 0, A(5) = Lees. 
Consider the list (5.6) of all the functions in F#. Let us represent each f; in this 
list as an infinite sequence; that is, let 


fi = (fi(0), fi), fi(2), Fi(3), Fi(4), fi(5), ---)- 


Using this notation, we can rewrite the list of functions (5.6) in the following 
vertical form: 


fo = (fo(0), fo(1), fo(2), fo(3), fo(4), fo(5), ---) 
fi =(fi(0), AAG), fa(2), Ar(3), frl4), AS), ---) 
fo = (fo(0), foQ1), fo(2), fo(3), fo(4), fo(5), ---) (5.8) 
fs = (f3(0), fs), fa(2), fa(3), fs(4), fs(5), ---) 
fa =(fa(0), fa), fa(2), fa(3), fa(4), fal), ---) 
fs = (fs(0), fsQ), fs(2), fs(3), fo(4), f5(5), ---) 


In the proof of Theorem 5.3.2, a function g: w — {0,1} was defined so that g 
is not equal to any function in the list (5.8). This is done by going down this list 
and assigning a value to g(i) that is different from the diagonal value f;(i) for 
each f; appearing in (5.8). To illustrate this idea, let us give some specific 
values to the entries that can appear in the diagonal of (5.8). Suppose fo(0) = 0 


, fil) =1, fo(2)=0, fo(3)=0, fa(4) =1, and fe(5) = 1. Thus, 6.8) 


o=( 0 , fo(1), fo(2), fo(3), fo(4), fo(5), ---) 
fi=(fi(0), 1 _ ), f1(3), fi(4), fa(5), .--) 
fo = (f2(0), fe(1), » fa(3), fo(4), fa(5), .--) (5.9) 
fz =(f3(0), fa(1), fs(2), O , f3(4), fs(5), .--) 
fa =(fa(0), fa(1), fa(2), fa(3), 1 , fa(5), ---) 
fs = (fs(0), fs(1), fs(2), fo(3), fs(4), 1 , .--) 


= 2, © oe Le De B@ ¢ DW nase 


We have put the function g below the infinite list (5.9) where the values of g 
are determined by applying definition (5.7), in the proof of Theorem 5.3.2. For 
example, to evaluate g(0) we see that f9(0) = 0, and so g(0) = 1 by (5.7). 


Thus, g(0) # fo(0), and we are thereby assured that g 4 fj. Now we evaluate 
g(1). Since f;(1) = 1, we obtain the value g(1) = 0. Hence, g(1) 4 fi (1) 
and g # f;. Again, because f.(2) 0, we conclude that g(2) = 1 and 
g # f2. Continuing in this manner we construct a function g: w — {0,1} that 
is different from every function in the list (5.9). This is the clever diagonal 
argument that Cantor introduced to mathematics. 


Theorem 5.3.3. Let A and B be sets. If A is uncountable and g: A - B is 
a one-to-one function, then 8 is uncountable. 


Proof. Let A be an uncountable set, and let g: A-—+ B be a one-to-one 
function. Suppose, for a contradiction, that 8 is countable. Since g: A - PB is 
one-to-one, Theorem 5.2.3 thus implies that 4 is a countable set, contradicting 
the fact that A is uncountable. Therefore, 8 is uncountable. LJ 


Before proving our next lemma, we make a simple observation. Whenever 
f:w— {0,1}, we have that f(m) € {0,1} for all nm € w. We can thus use 
f to define a real number by means of an infinite decimal expansion. Let 
fn = f(m) for each nm €w. Then we have the real number given by 
0.fofifofs---fn-++. For example, suppose that f(0)=1, f(1)=0, 
f(2)=1, f(3) =1, «+>, then 0. fofifefs::-fn---=0.1011---f,---- 


Decimal expansions that possess only the digits 0 and/or 1 are, in fact, unique; 
that is, if a real number has a decimal expansion consisting of 0’s and/or 1’s, 
then it has only one such expansion. 


Lemma 5.3.4. There is a one-to-one function G: “{0,1} +R. 


Proof. Let F = “{0,1}. For each f € F let us define (a) f,, = f(n), for 
alné€w.So f, € {0,1} for every n € w. Define the function G: F + R 
by 


G(f) = 0.fofife-- 
(5.10) 


for each f € F.Let f and f be functions in 7. Assume G(f) = G(h). We 
shall prove that f = h . Because G(f) = G(h), we conclude from (5.10) that 
ie ee oe 


Since such decimal expansions are unique, we have that f,, = h,, forall n € w 
. Hence, f(n) = h(n) for all n € w, by (a). Thus, f = / and G is one-to- 
one. L] 


We can now present and prove Cantor’s classic theorem. 


Theorem 5.3.5 (Cantor). The set of real numbers JR is uncountable. 


Proof. By Lemma 5.3.4, let G: “{0, 1} — R be one-to-one. By Theorem 5.3.2, 
“{0, 1} is uncountable. So Theorem 5.3.3 implies that JR is uncountable. 1] 


Exercises 5.3 


1. Let A be uncountable. Prove that 4 » 9 is uncountable for any nonempty set 
B. 

. Let A be uncountable and /} be countable. Prove that A \ B is uncountable. 

. Prove that the set of irrational numbers JR \ Q is uncountable. 

. Suppose that 4 is uncountable and 4 C Jf . Prove that #3 is uncountable. 

. Prove that “),) is uncountable. 7 

. Suppose that 4 is uncountable. Prove that ‘P tj A) is uncountable. 

. Let A # @ . Prove that if J is uncountable, then A FB is uncountable. 

*8. Let F be as in Theorem 5.3.2, and let(x ) fy, fo fn bea finite list 
of functions in F . Using the argument in the proof of Theorem 5.3.2, define a 
new function g € F that is not in the list (x ). Conclude that ¥ is infinite. 


eno»u»sb w nw 


9. Suppose that someone claims that the set of real numbers in the interval ( eT ) is 


countable and that all of these real numbers can be enumerated as in (5.11), where 


each such real number is represented by an infinite decimal expansion: 


a1 = .12345689234 --. 
a = .68729958219 - - - 
a3 = .05050506620 - - 
a4 = .57591884622 - -- (5.11) 


vy ol G1 TiQViglji4XVi5*** 


You must show that this claim is false. Using Cantor’s diagonal argument, define 
a decimal expansion for a real number fy in (0, | ) that does not appear in the 


list (5.11). In order to ensure’ uniqueness, your expansion 
b by bo be caer b; . .. should not contain the digits 0 or 9. Identify the first 


four digits in the decimal expansion of /) . Prove that /) =f Ze, forall RE Ww. 


10. Let S {q a Q et a q < 1} . Theorem 5.2.4 and Exercise 5 on 


page 123 imply that is countable. We can thus enumerate all of the elements 


in § ina list (*) 91,92, 93,..., by Theorem 5.2.8. Since each of these 
rational numbers has an infinite decimal expansion 
Gi = -9i19i29i39i44i5 *** , one can define a real number b € (0, 1) 


that is not in the list (% ) just as in Exercise 9. Is fy a rational number? Justify 
your answer. 

11. A real number that is not algebraic is said to be a transcendental number (see 
page 124, Exercise 15). Let JJ be the set of all transcendental numbers. Show 
that TT is uncountable. 


Exercise Notes: For Exercise 3, apply Exercise 5 on page 123 and Exercise 2. For 
Exercise 9, if the decimal expansion of fy does not contain a 0 or 9, then fy will have a 


unique decimal representation. Thus, if the decimal expansions of fy and 2& in (0, 1) 


have different digits in at least one decimal place, then /) f a. For Exercise 11, use 


Exercise 15 on page 124 and Exercise 2. 


5.4 Cardinality 


In set theory, the cardinality of a set is a measure of how many elements are in 
the set. The set A = {1,2,3,..., 11} has 11 elements, and so the cardinality of 
A is 11. Thus, |A| = 11 (see Definition 5.1.2). The cardinality of an infinite 
set X will also be denoted by |X| . In this section, we shall present Cantor’s 
method for measuring the size of an infinite set without the use of numbers. 
There are infinite sets, as we will see, where one of these sets has cardinality 
much larger than that of some other infinite set. Therefore, it is possible for one 
infinite set to have “many more” elements than another infinite set. 


What does it mean to say that two sets have the same cardinality, that is, the 
same size? Cantor discovered a mathematically precise and simple answer to this 
question. 


Definition 5.4.1. For sets 4 and 3 , we say that 4 has the same cardinality as /3 , 


denoted by | A e | B | , if there is a bijection 7 > Ait 


Remark 5.4.2. The expression |A| =, |B| looks like an equation; however, the 
assertion |A|=,|B| should be viewed only as an abbreviation for the 
statement “A has the same cardinality as #.” In other words, |A| =, | B| 
means that “there is a function f: A— P that is one-to-one and onto PB.” 
When 4 and # are finite sets, Theorem 5.1.7 implies that |A| = |B| if and 
only if |A] =, |B . 


The relationship =c, given in Definition 5.4.1, is reflexive, symmetric, and 
transitive (see Exercises 4—5). 


Theorem 5.4.3. Let A bea set. Then é 0, 1}| ¢ |P(A)| . 


Proof. For each f € “{0,1} , define X, C A by Xp = {w EA: f(x) = 1} 
Define the function G: “{0,1}-+ P(A) by G(f)=Xy for each 
f € “{0,1} . Then G is one-to-one and onto P(A) (see Exercise 10). 0 


Corollary 5.4.4. |“{0,1}| =, |P(w)| . 


Theorem 5.4.5. If | A] =, |B| , then |P(A) 


= |P(B)| : 


Proof. Suppose that |A| =, |B|. Thus, there is a bijection f: A- B. 
Consider the function G: P(A) + P(B) defined by G(X) = f|X] for every 
X € P(A). Exercise 15 on page 68 shows that G is one-to-one. To show that 
G is onto P(B), assume that Y € P(A). Let X = me ae Since f is onto 
Y, Exercise 13 on page 68 shows that G(X) = Y. Therefore, G is a bijection. 
XO 


Theorem 5.4.5 shows that the power set operation “preserves” cardinality. Our 
next theorem identifies three other such operations. 


Theorem 5.4.6. Suppose that |A] =. |A| and |B| =, |L]. 


(1) If ANB=@ and KNL=@, then |AUB|=.|KUL|. 
(2) |AseBl=s2 |K.x«. 4) 
(3) [AB] =,.)92). 


Proof. Assume that |A]=,|K| and |B|=,|L|. Let f: A— Kk and 
g: B — L be bijections. 


(1) Assume ANB=@ and KNL=@.LetH: AUBS KUL 
be defined by 


H (x) f(z), ifxeA; 
_ g(x), ifx€ B. 


Since AN B = @, it follows that HV is a function; that is, FY is 
single-valued. As f and 9 are bijections, 7] is a bijection because 


KNL=s. 


(2) Define hh: Ax B+K-xJL by .Then isabijectionfrom to 
(see Exercise 19). 
(3) We have that and_ are bijections. Let .Thus,_ . Observe that 


.Define by 


for each . To prove that is one-to-one, let and  . Assume that 
Thus, forall . Hence, 


(see Remark 3.3.11 on page 61). Since __is one-to-one, we conclude that 
(5.12) 


We now show that forall .Let .Thus, .Letting  in(5.12), we obtain 

since by Theorem 3.3.18. So, and hence, is one-to-one. To prove 
that is onto , let . Then and (see Exercise 20). Therefore, the 
function is a bijection. UJ 


In a similar manner, one can prove the next theorem (see Exercises 23—25). 


Theorem 5.4.7. For allsets , , ,and_ , we have the following: 


(1) If ,then 
(2) 
(3) 


The proofs of our next two corollaries apply Theorems 5.4.6 and 5.4.7. 


Corollary 5.4.8. 


Proof. Let .Itiseasytoshowthat( )  . Thus, 


Therefore, (see Exercise 5). LJ 


Corollary 5.4.9. 


Proof. Theorems 5.2.5 and 5.2.6 imply that( ) .Notethat' .We prove that 
as follows: 


Therefore, . U1 


Using our cardinality notation, we will now restate some of the results that 
were previously established about two countably infinite sets. 


Theorem 5.4.10 (Cantor). = and 


Proof. Exercises 4 and 5 on page 123 imply that and are countably 
infinite. By Theorem 5.2.6, there are bijections and .Therefore, and 
L] 


What does it mean to say that one set has smaller cardinality than another set? 
Cantor found a simple answer to this question as well. 


| 


Definition 5.4.11. A set has cardinality strictly less than that of aset| |, denoted by 
, if there is a one-to-one function and there is no function that is both one-to-one 


and onto 


Theorem 5.4.12 (Cantor). 


Proof. Let bedefined by — . This function is one-to-one. We now show that 
there is no function that is one-to-one and onto  . Suppose, for a 
contradiction, there is such a bijection . Hence, is one-to-one by 
Theorem 3.3.18, and therefore, is countable. This contradicts Theorem 5.3.5. 
Thus, .U 


After proving Theorem 5.4.12, Cantor asked the following question: Is there a 
set of real numbers whose cardinality is strictly between the cardinality of 
and the cardinality of +? In 1878, Georg Cantor announced a conjecture that is 
now called the continuum hypothesis. 


Continuum Hypothesis. There is no set — such that 


The continuum hypothesis proclaims that every set of real numbers is either 
countable or has the same cardinality as the set . Cantor struggled to resolve 
this conjecture, without success, for much of his career. The problem persisted 
and became one of the most important unsolved mathematical problems of the 
twentieth century. It was only after Cantor’s death that it was shown to be an 
unsolvable problem. The contributions of Kurt Gédel in 1940 and Paul Cohen in 
1963 showed that the continuum hypothesis cannot be proven or refuted using 
the axioms in 


Recall that for every set. , the powerset is the set of all subsets of _. For 
example, if , then and observe that (see Definition 5.1.2). In fact, one 
can prove that if isa finite set with many elements, then has many 
elements (see Exercise 23, page 116). Thus, whenever is a finite set; 
however, what happens when the set __ is infinite? Georg Cantor answered this 
intriguing question as well, using his diagonal argument. 


Theorem 5.4.13 (Cantor). Let —_ be any set. Then 


Proof. Let beaset.Let bedefinedby  fforall . It is easy to show that 

is one-to-one. We must show that there is no one-to-one function that is 
also onto . Suppose, for a contradiction, that there exists a bijection 
Observe that forall .Let  . Clearly, and thus, .As is onto, 
thereisan  suchthat  . There are two cases to consider, namely, either — or 

. If, then the definition of implies that . Since ,wehavethat , 
which is a contradiction. If , then the definition of impliesthat . Because 

, we conclude that , which is again a contradiction. Thus, there is no 
bijection .Therefore, .L 


Definition 5.4.14. Let and be sets. Then has cardinality less than or equal to 


, denoted by __, if there is an injection 


Theorem 5.4.15. If , then 


Proof. Assume _. So there is a one-to-one function .Define by forall 
. Exercise 15 on page 68 shows that _is one-to-one. Therefore, . LJ 


Theorem 5.4.15 shows that the power set operation “preserves inequality,” 


and its proof involves only a slight modification of the proof of Theorem 5.4.5. 
The following theorem identifies other set operations that also “preserve” the 
notion of inequality given by Definition 5.4.14. The proof can be obtained by 
appropriately modifying the proof of Theorem 5.4.6 (see Exercises 28-30). 


Theorem 5.4.16. Suppose that and 


(1) If and_ ,then 
(2) 


(3) if is nonempty. 


We will soon present the Schréder—Bernstein Theorem, which is very useful 
for proving many results about cardinality. The theorem states that if there are 
functions and __ that are both one-to-one, then there exists a function that 
is one-to-one and onto . The theorem sounds very reasonable; however, a 
proof that does not depend on the axiom of choice was seen as quite challenging 
and even eluded the brilliant Georg Cantor. 


The following lemma, illustrated in Figure 5.2, will be used in our proof of the 
Schréder—Bernstein Theorem. 


Figure 5.2. Illustration of Lemma 5.4.17 where __ satisfies 


Lemma 5.4.17. Let and _ be functions. Then there exists an __ that satisfies 
the identity 


(5.13) 


Proof. Let and  .Definethe function by 


Therefore, by the definition of ,foreach owehavethat and( )_ .So,if 
aset satisfies ,then( ) wouldimply thatthe set satisfies , and thus, 
(5.13) holds. 


Claim1.Let and .If _ ,then 


Proof. See Exercise 32.(Claim1) UO 


Consider the set. .Since ,weseethat ,andso __ is nonempty. Let 
Clearly, and 


Claim 2. 


Proof. Let .Because ,weseethat .So, byClaim1. Since _, we infer 

that .Thus, forevery .Hence, ;thatis, .Wenowprovethat . As 
, Claim 1 implies that .So  .Thus, ;thatis, .Therefore, . (Claim 2) 
LJ 


Since ,weconclude that satisfies (5.13). (Lemma) L] 


Figure 5.2 depicts the set in Lemma 5.4.17 and the identity (5.13). This 
figure may also help the reader to better understand the proof of the following 
theorem, which is named for Ernst Schréder and Felix Bernstein. The proof 
depends only on (at most) the first seven Zermelo—Fraenkel axioms. 


Theorem 5.4.18 (Schréder—Bernstein). Let. and be any two sets. Suppose 
that and .Then 


Proof. Assume that and_. So, there are one-to-one functions and_. By 


Lemma 5.4.17, there isan (see Figure 5.2) satisfying 


(5.14) 


Since is one-to-one, Theorems 3.2.7 and 3.3.15 imply that —_is a one-to-one 
function such that . As , Equation (5.14) implies that . Define the 
function by 


(5.15) 
We will soon prove that —_is one-to-one and onto 
Claim 1. 
Proof. Clearly, .Since is one-to-one, Exercise 5 on page 68 implies that 


. Thus, by (5.14), we have that’ .(Claim1) UO 


Claim 2. The function — is one-to-one. 


Proof. Let and .Assumethat  . The definition (5.15) of — yields three 
key cases:(1) and  arebothin ,(2) and_~ arebothin , or (3) 
and . The argument in case (3) will cover the fourth case: = and 


(1)If and ,then  by(5.15). Therefore, because _ is one-to-one. 


(2)If and ,then  by(5.15). Since the function is one-to-one, we have 


(3)If and ,thenweobtain .Thus, .Claim1impliesthat  . Therefore, 
case (3) is impossible. (Claim 2) U1 


Claim 3. The function is onto 


Proof. Let . Hence, either (1) or (2) . We consider these two cases as 
follows: (1) If , then for some  . From the definition (5.15) of , it 
follows that 


(2) If ,then(5.14)impliesthat .Thus,( ) forsome_. By applying the 


function tobothsidesof( ),weobtain’ .Therefore, as_ . (Claim 3) 
L] 


Since isa bijection, we see that .(Theorem) L1 


Given sets and , sometimes one can easily define one-to-one functions 
and; yet it may be very difficult to specify a one-to-one function from 
onto . In this situation, one can apply the Schréder—Bernstein Theorem to 


conclude that such a function does exist and thereby conclude that and 
have the same cardinality. 


Lemma 5.4.19. Let be the set of rational numbers and let _ be the set of real 
numbers. Then 


Proof. For ,let .Let  bedefinedby foreach .Toprovethat is 
one-to-one, let and .Assumethat .Weshall prove that . Assume, fora 
contradiction, that . Without loss of generality, we can assume that. Since 
the rational numbers are dense in’ ,thereisa suchthat .Thus, and 
Therefore, , whichis acontradiction.So and _ is one-to-one. L] 


Theorem 5.4.20. 


Proof. By Theorem 5.4.10, we have that . Hence, by Theorem 5.4.5, we see 
that . Lemma 5.4.19 allows us to now conclude that . Moreover, 
Corollary 5.4.4 states that , and Lemma 5.3.4 implies that . Thus, 
Therefore, by the Schréder—Bernstein Theorem 5.4.18, we have that .U 


Our next corollary shows that the real line has the same cardinality as the 
plane 


Corollary 5.4.21. 


Proof. By Corollary 5.4.9, . Thus, by Theorem 5.4.20 and 
Theorem 5.4.6(2). LJ 


Theorem 5.4.20 allows us to reformulate the continuum hypothesis: 


For every infinite set. , Theorem 5.4.13 asserts that . So CH can be viewed is 
a special case of the Generalized Continuum Hypothesis (GCH): 


The combined work of Kurt Gédel and Paul Cohen also shows that the GCH 
cannot be proven or refuted using the axioms in 


Exercises 5.4 


1. Let and let) . Prove that 
2. Prove that 
3. Let and be countably infinite sets. Prove that) . Conclude that 


4. Let and be sets. Prove that) ,andprovethatif then 


5. Prove that if and, then 
6. Prove that if and, then 
7. Let be countable. Prove that if , then is countable. 


8. Let be uncountable. Prove that if, , then is uncountable. 
9. Prove that if and, then 


10. Prove that the function) _in the proof of Theorem 5.4.3 is a bijection. 


11. Prove that the set is uncountable. 
12. Using Theorem 5.4.18, prove that , where 
13. Let), and let be the function given by 


The function is one-to-one. The proof of Theorem 5.4.13 shows that is not 
onto because the subset of defined by is not in the range of 
Evaluate the set 

14, Let be onto) ,andlet) . Show that 

15. Suppose and. Prove that 

16. Suppose and. Prove that 

17. Prove that 
18. Suppose that , ,and' . Using Theorem 5.4.18, prove that), , and 


19. Let and be bijections. Let be defined by for all, . Prove that 


is a bijection. 


20. Prove that where the functions) , ,  ,and are as in the proof of 


Theorem 5.4.6(3). 


21. Recall that is the set of all functions from to (see Definition 3.3.6). 
Suppose is nonempty. Let and be distinct. Using a diagonal argument, 


show that there is no function that is onto . Now, find a one-to-one 


function) . Conclude that 

22. Provethat| . Conclude that 
23. Prove Theorem 5.4.7(1). 
24. Prove Theorem 5.4.7(2). 
25. Prove Theorem 5.4.7(3). 


26. Using Theorem 5.4.7 and Exercise 3, prove that Using Exercise 22, show that 


and 


27. Using Theorem 5.4.7 and Corollary 5.4.8, prove that 


Conclude, using Exercise 22, that 


28. Prove Theorem 5.4.16(1). 
29. Prove Theorem 5.4.16(2). 
30. Prove Theorem 5.4.16(3). Hint: Let and let be one-to-one. Define 


by 


Show that is a “left inverse for ”; that is, show that for all, . Now replace the 
use of in the proof of Theorem 5.4.6(3) with the function and modify the proof 


appropriately. 


31. Let and be nonempty sets. Using the axiom of choice, prove that there 
exists a one-to-one _ if and only if there existsa __ that is onto 
32. Let and befunctions,andlet and .Provethatif , then 
33. Let , ,and ~ beas in Lemma 5.4.17. Show that if ,then is onto 
andif ,then  isonto 
34. Let .Using Theorem 5.4.18, show that . 
35. Let and .The functions defined by and_ defined by are 
both one-to-one. Let be. Show that . Now, using (5.15) as a guide, 
explicitly define a bijection 
36. Let  and__ be functions. Consider the set’ whose elements are defined 
by the following recursion: 


(a) __, (b) 


Let .Provethatif is one-to-one, then 


37. Let and .Let ~ beas defined in Exercise 36. Assume __ is one-to-one 
and satisfies. . Prove 

38. Let , ,and __ beas in the proof of Lemma 5.4.17. Now assume that 

is one-to-one and let _ be as in Exercise 36. Prove that 

39. Let beasetandlet .Suppose forall and  in_ , we have that 
whenever. Let 


and let .Provethat . Moreover, suppose that satisfies . Prove that 


Exercise Notes: For Exercise 12, use the inverse tangent function . For 
Exercise 21, review the proofs of Theorems 5.3.2 and 5.4.13. For Exercise 22, 
use Theorem 5.4.18. To show that , given ,let' betheset where is 
the function in Exercise 9 on page 116. For Exercise 23, if ,then isin 
For Exercise 24, let and .If ,then let bedefined by and let 
satisfy . For Exercise 25, if ,thenlet bedefined by . For Exercise 31, 


in the implication ( ), let bea left inverse for as in Exercise 30. The 
implication ( ) uses the axiom of choice. For Exercise 32, see Exercise 3 on 
page 36 and Exercise 2 on page 68. For Exercise 34, merge the two infinite 
decimal expansions, each one not ending with repeating 9’s, into one expansion. 
Exercise 36 outlines a different proof of Lemma 5.4.17 when __ is one-to-one. 
Use the “double subset” strategy. For Exercise 37, since _is one-to-one, Claim 
1 (in the proof of Theorem 5.4.18) implies that . For Exercise 38, from the 
proof of Lemma 5.4.17, it follows from Exercise 37 that. . One can show that 
and forall . For Exercise 39, review the proof of Lemma 5.4.17. 


REMARK: Exercises 36, 37, and 39 imply that the set given in the proof of 
Lemma 5.4.17 equals the set. in Exercise 36 when __ is one-to-one. 


6 
Transfinite Recursion 


In Definition 3.4.2, we introduced the notion of a partial order on a set. The 
concept of a total order was then presented in Definition 3.4.3. We also proved 
that there exists a total ordering on the natural numbers so that every nonempty 
set of natural numbers has a least element with respect to this ordering (see 
Theorem 4.4.13). As a result, we were able to prove the important recursion 
theorem. In this chapter, we will generalize these ideas. 


6.1 Well-Ordering 


Definition 6.1.1. (Well-Ordering) A well-ordering is a total ordering on a set A for 


which every nonempty subset of A has a smallest (least) element. 


Suppose that = is a well-ordering on a set A. Whenever SC A is 
nonempty, then the set § has a least element ¢ ; that is, ¢¢€ § and ¢ < x for 
algesS. 


Let < be the usual order on set Z of integers. Then < isa total orderon Z . 
Since S = {n € Z:n <0} has no least element, < is not a well-ordering on 


Z, . 


Theorem 6.1.2. Let < be a well-ordering on A . Then there exists no function 
f:w—A Satisfying f(n*) ~ f(n) foralln Ew. 


Proof. Suppose there is a function such that f(m*) ~ f(m) for all new, 
where = is a well-ordering on A. Let S= {f(n):n€w}. Clearly, $ isa 
nonempty subset of A . Thus, § has a least element f(k).So f(k) < f (n) for 
all m€w; however, by assumption, we have that f(k*) ~ f(k), a 
contradiction. LJ 


Let A be a set with well-ordering < and let S C A be nonempty. When 
les is the smallest element’ in S, we shall = write 
¢ = the <-least element in S. 


Lemma 3.4.12 implies that ¢ is unique. 


Theorem 6.1.3. If 4 is an infinite set with the well-ordering =<, then there is a 
function h: w— A such that h(n) < h(n*) forall n € w. 


Proof. For each x € A, let G, = {y € A: x ~ y} be the set of elements in 
A that are “greater” than x, and let L.. ={y€ A:y<x} be the set of 
elements in A that are “less than or equal” to x. As # is a total order, 
A=L,,UG, for each x € A. Moreover, since A is infinite, it follows (see 
Exercise 15, page 116) that for any 7 € A, 


if L.. is finite, then G‘, is infinite. 


(6.1) 
Let P be the <-least element in 4 , and define f: A — A by 
f(a) the <-least element in G,, if G, # 9; 
wx) = 
D, if Gp = @. 
(6.2) 


So (a) if xe A and G, #@, then x ~ f(a). By the recursion theorem, 
there is a function /,: ,)» —s A such that 


(1) h(0)=p,() h(n*) = f(h(n)), foralln ew. 


We now prove _ that Linn) is finite for all nE€w. Let 
[={néw: Lyn) is finite} . Because h(O) = p is the smallest element in 
A, we see that Lo) = {p}, which is finite. So 0 € J. Let n € J. Thus, 
Lin) is finite, and Grin) #O by (6.1). As h(n*) = f(h(n)) and 
Gin) # @, we infer from (6.2) that h(n*) is the <-least element in Gp(n). It 
follows that Liin+) = Lngny U {h(m™ )} , which is a finite set. Hence, n+ € J 
. Therefore, J —w and Ly;,) is finite for all n €w. Thus, by (6.1), 
Grin) FO for all new. Let new. Since f(h(n))=h(n*) and 
Grin) # O, the above (A) implies that h(n) ~ h(n*).O 


Theorem 6.1.4. Let <’ be a well-ordering on the set #8. Suppose that 
h: A — B is one-to-one. Define the relation < on 4 by x = y if and only if 
h(a) =’ h(y), for all a and y in 4 . Then = is a well-ordering on 4. 


Proof. See Exercise 6. LJ 


Let =< be a well-ordering on a set }{’. For each y% € WW’, we define the 


following initial segments up to uc: 
s*(u)={zEWw:2 xu}, (6.3) 
sS(u)={r EW :2 <u}. (6.4) 


The set s~(w) consists of the elements in }{/ that are less than (smaller than) 
u, whereas s*(w) is the set of elements that are less than or equal to wv. In 
other words, wu € s*(uw) and wu ¢ s~(w). Furthermore, for w and v in W, if 


u <p, then #*(t) C 8%(u) C 8%(v) C 5*(v). 


If the well-ordering ~< is understood, we shall write s(w) for s“(w) as well 
as s(u) in place of s*(w). For example, because w is well-ordered by € and it 


is transitive, if new, then 
s(n) ={i€w:ien}b=n, (6.5) 


s(n) ={iE€w:iEen}=n'. (6.6) 


Theorem 4.4.14, the strong induction principle, applies to the finite elements 
in w. The term transfinite applies to elements that may not be finite. 


Theorem 6.1.5 (Transfinite Induction Principle). Let < be a well-ordering on 
W .Let A bea subset of |, and suppose that 


for allu € W, if s(u) C A, thenu€ A. 
(6.7) 


Then A = W. 


Proof. Suppose that A C W satisfies (6.7) and assume, for a contradiction, that 
there exists a u € W \ A. Thus, the set WW’ \ A is nonempty. Since < is a 
well-ordering, W \ A has a least element wu. So wu ¢ A, and if v < u, then 
v€A. Hence, s(u) C A. As A satisfies (6.7), we have that we A, a 
contradiction. LJ 


Exercises 6.1 


1. Let JR be the set of real numbers and let < be the standard order on JR . 
Suppose f: w — [IR is such that for all 2 and mm inwWw,ifn Em, 


then f(n) < f(m).Le A= {f(i):iew}. 


(a) Let 7 and 7 bein W. Prove that if f (i) < f(j). then LE}. 
(b) Prove that < is a well-ordering on A . 


2. Let 4 w U { w} , and for all @ and Y in A , define the relation € on 
A by © © y ifand only if © € y or L = Y. Show that € is a well- 
ordering on 4 . 


3. Suppose that =< is a well-ordering on A and that f : A no A is such that if 
x -~< ¥, then A i (a) <f (y) whenever and Y arein 4. 


(a) Show that f is one-to-one. 
(b) Prove that 2 =< f (x ) foralg € A. 


4. Let A be an infinite set with the well-ordering =. Now let fy): yy) —s A be 


the function defined in the proof of Theorem 6.1.3. Let y € A . Prove that for 
alneéw,ify~ h(n), then y h(i) forsome 7 € 7). 

“5. Suppose that =< is a well-ordering on Jf/ and let C’ C W . Show that <o 
is a well-ordering on (" . 

*6. Prove Theorem 6.1.4. 


7. Prove that every finite set has a well-ordering. 
8. Prove that every countably infinite set has a well-ordering. 


Exercise Notes: For Exercise 5, apply the result of Exercise 14 on page 76 and review 
Definition 3.4.15. For Exercise 6, Exercise 12 on page 76 implies that =< is a total order on 


A . For Exercises 7-8, use Theorem 6.1.4. 


6.2 Transfinite Recursion Theorem 


A powerful tool in set theory is definition by transfinite recursion. If one wants 
to define a function on a set using transfinite recursion, then one must first have 
a well-ordering on the set. Many of the important functions that we shall 
introduce later in the book are defined by means of transfinite recursion. This 
recursion principle plays a crucial role in set theory, and its importance cannot 
be overstated. 


The concept of transfinite recursion is grounded in recursion on w. Let us 
recall the “finite” Recursion Theorem 4.2.1: Let f: A — A be a function and 
let a € A. Then there exists a unique function },: yw —s A such that 


(1) h(O) =a and (2) h(n*) = f(h(n)), for all n € w. 


The value h(n*) depends on the “previous” value h(n) and on the function f . 
Our proof of the finite recursion theorem used the fact that f was a set. 


We would like to extend Theorem 4.2.1 and prove that there exists a function 
h: w-—> A so that h(n*) depends on all of the prior values h(0), A(1), ..., 
h(n). We would prefer, however, to prove that there is a function ) where 
each value h(n) depends on the restricted function h | n*, that is, on 
h | {0,1,2,...,n} . Can we prove such a result? 


_ As noted on page 121, Theorem 2.1.3 implies that the following is a set: 
<“A = {p: for some n € w, p is a function from n to A}. 


Thus, for every function p: n + A, where n € w, we have that pe ““A. 
Now let f: ““A — A bea function and let a@ € A . To positively address the 


above question, we would need to prove that there exists a function h: w—-+ A 
that satisfies the following two conditions: 


(i) h(O) = a and (ii) h(n*) = f(h |} n°), for alln € w. 


One can combine conditions (i) and (ii) into one condition. Since the empty 
function @ is in ©“4, one can let the function f:““A —- A. satisfy 
f(@) = a and then require that ) meet the following single condition: 


h(n) = f(h | n), for all n Ew. 
(6.8) 


If , satisfies (6.8), then it will satisfy (i) and (ii); namely, we will have 
(0) = f(h [ 0) = f(@) =a, 

(1) = f(A T 1) = FL, 

h(2) = f(r [ 2) = F({(0, a) , (1, h(1))}), 

h(3) = f(h 13) = F({(0, a) , (1, h(1)) , (2, 2(2))})- 

Our proof of the Recursion Theorem 4.2.1 can be adapted to show that such a 
function / actually does exist; however, rather than do this, we will pursue a 


much more general result. In fact, we will establish two such generalizations, the 
first of which will imply that there exists a function / satisfying (6.8) (see 


Corollary 6.2.2). 


6.2.1 Using a Set Function 


Let = be a well-ordering on W and let A bea set. Recall the initial segment 
functions s and S that were defined on page 143. Theorem 2.1.3 implies that 
the following sets exist: 


“WA = {q : for some u € W, g is a function from s(u) to A}, 


*WA = {gq : for some u € W, g is a function from s(u) to A}. 


Our next theorem is an extension of the finite recursion theorem on w. The 
proof of this theorem uses a method that will be applied again in Section 6.2.2. 
We shall be using the expression set function only to emphasize that a given 
function is a set. 


Transfinite Recursion Theorem 6.2.1. Let < be a well-ordering on a set |’. If 
F: “WA. A is a set function, then there exists a unique function 
H: W — A such that 


H(u) = F(A | s(u)), for allu € W. 
(6.9) 


Proof. Let < be a well-ordering on WV and let pr: ~W4 _, 4 be a function. 
When uw € W, we will say that g: s(w) — A is F’-recursive up to wu if and 
only if for all a € s(u), we have that g(a) = F'(g | s(a)). 


Claim 1. Let g: s(w) + A and h: s(u) + A be F’-recursive up to uw . Then 
g=h. 


Proof. Suppose that g # h, and let v € s(u) be the <-least element such that 
g(v) # h(w). Thus, g(x) = h(a) forall r= v.So(a) g] s(v) =h | s(v). 


g(v) = F(g | s(v)) as g is F-recursive up to u and v € s(u) 
= F(h | s(v)) by (4) 


Hence, = h(v) as h is F-recursive up to u and v € s(u). 


We conclude that g(v) = h(v), a contradiction. (Claim1) UO 


Claim 2. Let uw € W and let g: s(u) + A be /-recursive up to uw. Then for 
all v < u, the function g | s(v) is F -recursive up to v. 


Proof. Let v < uw and let h = g | s(v). Hence, h: s(v) + A and 


h(y) = g(y) for all y € s(v). 
(6.10) 


For each we€s(v), we have that #w<wv-~<u, and _ therefore, 
h(x) = g(x) by (6.10), as x € s(v) 

= F(g | s(a)) as g is F-recursive up to u and z € s(u) 

= F(h | s(x)) _ by (6.10), as s(x) € s(v). 


So h(x) = F(h | s(a)), and thus, g | s(v) is F’-recursive up to v. (Claim 2) 
O 


Consider the relation G C WW x “A defined by 


(x,g) € G if and only if g: s(a) — A is F-recursive up to 2. 
(6.11) 


Claim 1 implies that G is a function. For each u € dom(G), let g, = G(u). 
Therefore, we have that the function g,,: s(w) —- A is F’-recursive up to wu 
whenever u € dom(G’). In particular, g,(a) € A forall x € s(u). 


Claim 3. Let vedom(G). If xes(v), then x €dom(G) and 
Out) =Ge( 2). 


Proof. Let v € dom(G) and x ~ v. Hence, g,: s(v) 4 A is F’-recursive up 
to vw. Claim 2 implies that g, | s(a) is /’-recursive up to x. Therefore, 
x € dom(G) and g, | s(x) = gy by Claim 1. So g,(a) = gz(a) as x € s(x) 
(Claim 3) O 


Claim 4. dom(G) = W. 


Proof. Clearly, dom(G) CW. Suppose, to the contrary, that 
W \ dom(G) # @. Let uc W be the <-least such that u ¢ dom(G). So 
v € dom(G) for all v ~ uw. Let h: s(u) - A be the function defined by (@ ) 
h(a) = gz(a) for all «€s(u), and let v <u. Claim 3 implies that 
Jv(x) = gy(x) for all & € s(v). Thus, g,(a) = h(a) for all x € s(v), and 
therefore, (W) h | s(v) = gy | s(v). Define h: s(u) + A by 


is) = es ifz=u: 


h(a), ifa~<u. 
(6.12) 


We will show that h is F’-recursive up to wu. Let vw € s(w). There are two 


cases to consider: vu~<wuw and v=u. If vu~<wu, we can prove that 
h(v) = F(h } s(v)) as follows: 


h(v) = h(wv) by (6.12), as v < u 
= gy(v) by the definition (@) of h 
F(g, | s(v)) as gy is F-recursive up to v 
= F(h} s(v)) by (¥) 
= F(h | s(v)) _ by definition (6.12) of h, because v ~ u. 


When v = w, we need to show that h(w) = F'(h | s(u)). The definition of h 
in (6.12) implies that h|s(u)=h and hence, 
h(u) = F(h) by (6.12), asu=wu 


= F(h | s(u)) since h | s(u) = h. 


Thus, h is F’-recursive up to uw. So wu € dom(G), a contradiction.(Claim 4) 
O 


Define H: W—-+A by H(x) = g,(x). The argument used to prove 
Claim 4, involving fh, shows that H(uw) = F(H | s(u)) for all we w. 
Moreover, the proof of Claim 1 adapts to show that HW is unique. (See 
Exercise 1.) (Theorem) L 


The relation € is a well-ordering on w, as established by Theorem 4.4.13. 
Theorem 6.2.5 and (6.5) thereby imply the following corollary. 


Corollary 6.2.2. Let f: ““A— A _ be a function. There exists a unique 
function A: w—+ A such that 


h(n) = f(h | n), for alln € w. 
(6.13) 


Corollary 6.2.2 allows us to prove the converse of Corollary 5.1.11. 


Theorem 6.2.3 (AC). Suppose that 4 is an infinite set. Then there is one-to-one 
function h:w—+ A. 


Proof. Let A be an infinite set. Recall that (see page 146) 
<“ A = {p: for some n € w, p is a function from n to A}. 


Whenever p € ““A and n= dom(p), we observe that ran(p) = p|n| is a 
finite subset of 4, and therefore, A \ ran(p) is infinite by Exercise 12 on 
page 116. Let C = P(A)\ {@}. By Theorem 3.3.24, there is a function 
g:C —-A_ such that g(B)€ B for all BEC. Define f: “A+ A by 
f(p) = g{A \ ran(p)). 


By Corollary 6.2.2, there exists a function /, with domain w such that 


h(n) = f(h | n) = g(A\ h[n]), for all n € w. 
(6.14) 


Thus, A: w —s A, and Exercise 11 on page 123 shows that /, is one-to-one. L] 


Richard Dedekind [3] defined a set to be infinite if and only if it has the same 
cardinality as a proper subset of itself. Such a set is said to be Dedekind-infinite. 
For example, the function f: w— w defined by f(n) = n* is one-to-one and 
shows that w and w\ {0} have the same cardinality. So, according to 
Dedekind, w is an infinite set. Our next result confirms Dedekind’s definition in 
general. 


Corollary 6.2.4 (AC). A set A is infinite if and only if there exists a one-to-one 
function f: A — A thatis not onto A. 


Proof. Let A beset. Assume that A is infinite. Theorem 6.2.3 asserts that there 
is a one-to-one function fh: w —+ A. Define f: A— A by 


h(n*), if « = h(n) for some n E w; 

f(x) - 
¥ if x ¢ ran(h). 

(6:15) 


Since / is one-to-one, it follows that f is a function and that f is one-to-one. 
As h(0) € A and A(0) is not in the range of f , we see that f isnot onto A. 


To prove the converse, suppose f: A — A is a one-to-one function that is 
not onto A . Corollary 5.1.6 implies that A is infinite. LJ 


6.2.2 Using a Class Function 


Let us say that a formula y(g,wu) is functional if Vgs!uy(g, wu); that is, for all 
g there exists a unique wu such that y(g,u). Let y(g, u) be functional formula 
and consider the class 


FE {(g, u): p(g, u)}. 


Because y(g,u) is functional, we can now view F as a class function. Let < 
be a well-ordering ona set )W’. If there were a set 4 such that forall g € “A 
, there isa y € A such that y(g,v), then Theorem 2.1.3 would imply that the 
following function #: ~““4 _, 4 is aset: 


PF ; {(g, v) :g E€ <WA VE A. and yg, v)}. 


Theorem 6.2.1 would then yield the existence of a function 7: WW’ —+ A so that 


H(u) = F(H | s(u)) for allue W 


(where the initial segment s(w) is defined on page 143). Thus, the set function 
H would satisfy y(H | s(u), H(u)) for each u € W. 


Can one prove the existence of this function #/ without a set such as 4 ? Yes, 


and this will be verified by our next theorem, whose proof is based on the 
argument used to prove Theorem 6.2.1. This theorem also validates an essential 
general tool in set theory, namely, definition by transfinite recursion. 


Transfinite Recursion Theorem 6.2.5 (Class Form). Suppose that (a, y) is a 
functional formula (which may contain parameters). If = is a well-ordering on a 
set }{/, then there exists a unique function /{ with domain | such that 


y(H | s(u), H(u)) for allue W. 
(6.16) 


The crucial ingredient in the proof of Theorem 6.2.1 was the existence of the 
set function G (see (6.11)). In our proof of Theorem 6.2.5, using the functional 
formula y(a,y) and the replacement axiom, we will be able to get a function 
such as G. Recall that the replacement axiom asserts that for any set A, if for 
each x € A, there is an element ¥Y that is “uniquely connected” to x , then we 
can replace every x € A with its unique connection ¥, and the result is a new 
set. 


Replacement Axiom. Let y)(,y) be a formula. For every set A , if for each 
x € A there is a unique ¥Y such that u(a,y), then there is a set § that consists 
of all the elements ¥ such that (a, y) forsome 7 € A. 


Thus, if Vas!lyw(a,y), then for each set 4, we have the following set: 


S={y:dr(rxeAAy(z,y))}.- 
Hence, f = {(a,y) € Ax S: u(a,y)} isaset function f: A+ S. 


We are now ready to prove Theorem 6.2.5. In a loose sense, our proof will be 
the result of replacing all of the expressions F(a) = y and y= F(x), 
appearing in the proof of Theorem 6.2.1, with the expression y(a, y). We could 


just ask the reader to do this as an exercise, but since the proof requires the 
replacement axiom, we shall provide a complete proof. 


Proof (of Theorem 6.2.5). Let < be a well-ordering on Jf’. Suppose that 
y(a,y) is functional. When uw € W, we will say that a function g is Y- 
recursive up to wu if and only if dom(g)=s(uw) and 


y(g | s(x), g(a)), for all x € s(u). 


We now establish four claims. 


Claim 1. Let 9 and ), be ¥-recursive up to uw. Then g=/h. 


Proof. Let g and fh be ¥-recursive up to uw. Suppose that g #/h. Let 
v € s(u) be the <-least element such that g(v) # h(v). Thus, g(x) = h(x) 
for all a <u. Hence, (A) g | s(v) =h | s(w). Since g and h are ¥- 
recursive up to wu, we have that y(g | s(v),g(v)) and y(h | s(v), h(v)). 
Since ¥ is functional, (a) implies that g(v) = h(wv), a contradiction. (Claim 1) 


LO 


Claim 2. If w<€ W and g is Y-recursive up to w, then for all wv ~ w, the 
function g | s(w) is ¥ -recursive up to v. 


Proof. Let weW and g be wy-recursive up to uw. Let v <u and 
h=g{ s(v).So 


h(y) = g(y) for all y € s(v). 
(6.17) 


Let x € s(v). Therefore, 2 < v <u and s(x) C s(v). Hence, h(x) = g(x) 
and h | s(x) =g | s(x), by (6.17). As g is Y-recursive up to w and 


x € s(u), we have that y(g | s(x), g(a)). Since h | s(x) = g | s(a) and 
h(a) = g(a), we conclude that y(h | s(x), h(x)). Thus, g | s(v) is 9- 
recursive up to v.(Claim2) OU 


Let wa, g) represent the property 
“g is a function that is y-recursive up to x.” 


Let A= {a €W: there exists a g such that w(a,g)}. Claim 1 implies 
that for all g € A there is a unique J such that w(a2,g). Thus, by the 
replacement axiom we have that S = {g : da(a € AA w(a,g))} is aset. Now 
define the relation G C W x S by 


(x, g) € G if and only if g is y-recursive up to x. 
(6.18) 


Claim 1 implies that G is a function. For each wu € dom(G), let g, = G(u), 
and hence, Ju is  -recursive up to wv. 


Claim 3. Let vé€dom(G). If we s(v), then #2 €dom(G) and 
gua) = gel(z). 


Proof. Let v € dom(G) and x < v. So Gy is Y -recursive up to v. Claim 2 
implies that g,, | s(a) is -recursive up to x. Thus, 2 € dom(G) and 
Jv | s(@) = gz by Claim 1. Since x € s(x), we conclude that g,(a) = g,(x). 
(Claim 3) O 


Claim 4. dom(G) = W. 


Proof. Clearly, dom(G)CW. Suppose, to the contrary, that 
W \ dom(G) 4 @. Let u€ W be the <-least such that u ¢ dom(G). So 


v € dom(G) for all v <~ uw. Let fh be the function with domain s(w) defined 
by (@) A(w)=ge(a), and let vu~<u. Claim 3. implies _ that 
Gv(x) = g(x) = A(x) for all x € s(v). Therefore, (W) h | s(v) = gy [| s(v). 
Let Y be the unique element satisfying y(h,y). Define the function h with 
domain s(w) by 


y, a= 
h(z) * fast 


h(x), ifa~u. 
(6.19) 


We will show that h is Y-recursive up to wu. Let v € s(w). There are two 
cases to consider: v ~ u and v= uw. If v ~ u, then we have the following 
equivalences: 


p(h | s(v), h(v)) iff ph | s(v), h(v)) by (6.19), as v ~ u 
iff (gu | s(v), gu(v)) by (¥) and the definition (@) of h. 


Since Jv is -recursive up to v, we have that y(g,, | s(w), g,(v)) holds, and 
thus its equivalent y(h | s(wv),h(w)) also holds. 


When v = U, we need to show that y(h | s(w), h(w)) holds. The definition 
of h in (6.19) implies that h | s(w) =h, and therefore the following are 
equivalent: 


yp(h | s(u), h(u)) iff p(h,h(u)) because h | s(u) =h 
iff y(h, y) as h(u) = y, by (6.19). 
Since y was chosen to satisfy ~(h, y), we conclude that y(h | s(w), h(w)) is 


true. Thus, f, is ¥ -recursive up to U.So u € dom(G), a contradiction. (Claim 
4. oO 


Now define the function ff with domain Wy by H(x)=g,(a). The 
argument in the proof of Claim 4, involving },, shows that y(H | s(w), H(u)) 


for all ~% € W. Moreover, the proof of Claim 1 adapts to show that #7 is 
unique. (Theorem) L] 


Remark 6.2.6. Let < be a well-ordering on WW. In Theorem 6.2.5, let (a, y) 
be a formula that contains parameters (free variables other than x and ¥). The 
function #7 given in Theorem 6.2.5 depends on the values that are assigned to 
these parameters. To illustrate this, suppose that p is a parameter appearing in 
y(x,y). Because p is a free variable in Y, we can express y(x,y) as 
y(p, x,y). Now let 7 be a set that can be assigned to the parameter P so that 
y(y,x,y) is functional. Theorem 6.2.5 implies that there is a unique function 
H1 such that y(y, H, | s(u), H(u)), for all uw € W.So H, depends on ¥, 
and Theorem 6.2.5 holds whenever the formula y(a,y) contains fixed sets, 
because these sets can be viewed as values that have been assigned to parameters 
(see page 25). 


Since the relation € is a well-ordering of w, Theorem 6.2.5 and (6.5) (see 
page 144) imply the next corollary. 


Corollary 6.2.7. Let yo(a,y) be a formula that is functional. Then there exists a 
unique function /’ with domain w such that 


Y(F | n, F(n)), for alin € w. 
(6.20) 


Let F’ be as in Corollary 6.2.7. Because Ff’ | 0 = ©, (6.20) implies that 
F(0) equals the unique ¥ satisfying y(@,y). Since F’ | 1 = {(0, F(0))}, 
we see that F'(1) is the unique ¥ for which y({(0,F'(0))},y) holds, and 
F’(2) equals the unique ¥ satisfying y({(0, F(0)), (1, F(1))},y), etc. We 
now apply Corollary 6.2.7 to show that every set is a subset of a transitive set. 


Theorem 6.2.8. For every set A, there exists a transitive set 4 such that 


Proof. Let A bea set. Now consider the following definition by cases: 


A, if f = 2; 
y Uf(k), if f is a function with domain k* for some k € w; 


S, if none of the above hold. 
(6.21) 


Let y(f,y) be a formula that expresses (6.21). Clearly, for each f, there is a 
unique Y so that y(f,y). By Corollary 6.2.7, there is a function /’ with 
domain w such that y(F | m, F(m)) for all m € w. Thus, for each m Ew, 
we have that 


A, if m = 0; 
Fm) = ifm ! 


UF (n), ifm =n* for some n € w. 
(6.22) 


Let A=U,-, F(n). We see that F(0) = A , by (6.22). So A C Une, F(n) 
. Thus, 4 C A. To show that 4 is a transitive set, let « € y and y € A.We 
show that 7 € 4. Because y € A,we have y € F(n) for some n € w. Thus, 
yCUF(n). Since F(n*)=UF(n), we see that « € F(n*). Hence, 
z € A and J istransitive. LJ 


Let 4, 4, and F be as in the statement and the proof of Theorem 6.2.8. By 
evaluating F(n) for m=0,1,2,..., one can _ verify that 


A=Au|JAul JU AuUU4u-:-. 


Definition 6.2.9. Let 4 be a set. Then the set 4 constructed in the proof of 


Theorem 6.2.8 is called the transitive closure of 4 . 


Exercise 6 shows that 4 , the transitive closure of a set A, is the “smallest” 
transitive set of which 4 is a subset. 


Exercises 6.2 


*4. Complete the proof of Theorem 6.2.1 by showing that the function }] satisfies 
H (w) = F( H | s(u)), for all a, € VY . Also show that /7 is the only 
such function. 


2. Show that Theorem 6.2.5 implies Theorem 6.2.1. 
3. Let = be a well-ordering on a set Jf’ and A_ be a set. Define 


0. *WA 5 Ww by £(g) equals the =<-least @ in \{/_ such that 
u¢dom(g). So if ge “WA and g: s(u) 4 A, then 
&(g) =u. Let G: “WA x W -s A be a function. Show that 


Theorem 6.2.1 implies there exists a function ff: WW“ —s A_ such that 


H(u) = G(H |} s(u),u), for all we W. 


4. Suppose that =< is a well-ordering ona set {7 and fi’. “WA _, A isa function. 
Let H: W — A beas in Theorem 6.2.1. 


(a) Let w% and y bein J} . Prove that if s( u) s(v) , then U“ = Uv. 


(b) Suppose that fh’ is one-to-one. Prove that /J is also one-to-one. 


5. Let = bea well-ordering on |{/ and <* bea well-ordering on A . Assume for each 
g€ “WA, there is a y € A such that g(a) <* y for all x € dom(g)- 
Show that there is an Jf: WY —s A so that for all » € Wand yw € W, if 
v ~< U, then H(v) ~<* H(u). 

*6. Let 4 be a set. Suppose that 4 C C’ where (’ is a transitive set. Prove that 
A C C’ . Thus, A is the “smallest” transitive set for which 4 is a subset. 

7. Let #8 be aset. Conclude from Exercise 6 that {B} is the smallest transitive set that 
contains #3 as an element. 

8. Let 4 be aset. Using the replacement axiom, one can show that 12 a a A} isa 
set. Establish the following: _ 

(a) Prove thatifg,€ A ,thn®C A. 

(b) Prove that AULJ{%: a € A} CA. 

(c) Prove that A UJ U{z a a A} is a transitive set. 

(d) Provethat AULJ{@:a€ A} =A. 


Pd 


9. Let A bea set. Using the proof of Theorem 6.2.8 as a model, prove that 
there is a function /’ with domain w such that 
1. F(O)=A,2. F(n*)={F(n)} foralnew. 


Let C = F|w]. Prove that 4 € © and forall x ,if xy eC,then {X} EC. 


10. Let A bea set. Using the proof of Theorem 6.2.8 as a model, prove that 
there is a function / with domain w such that 
1. F(O)=A,2. F(nt)=P(F(n)) forall new. 


Let C = F[w]. Prove that A € C and forall x ,if x eC, then P(X) EC. 


Exercise Notes: For Exercise 6, let /’ be as in the proof of Theorem 6.2.8, and 
prove that (Vn € w)(F(n) C C). For Exercise 8, use Exercise 6. 


7 
The Axiom of Choice (Revisited) 


In Chapter 3 (see page 66), we first introduced the following principle, which is 
applied in many areas of mathematics. 


Axiom of Choice. Let (A;:i¢ J) be an indexed function with nonempty 
terms. Then there is an indexed function (a; : i € /) such that x; € Aj, for all 


rel. 


The above function (a; :i€ J) , where x; € A; for all j € J, is called a 
choice function for (A; : i € J) . The axiom of choice is used to prove many 
theorems in mathematics. In real analysis, for example, the axiom of choice is 
(tacitly) applied to prove that a real valued function f: R — R is continuous at 
a point c € R if and only if Jim f(tn) = f(©) for every sequence (2,,) that 
converges to c. It is thus said that continuity is equivalent to sequential 
continuity. The axiom of choice is also used to prove that every vector space has 
a basis. 


In Chapter 3, it was shown that the axiom of choice implies the following 
theorem, which asserts the existence of another choice function // . 


Theorem 3.3.24 (AC). Let @ be a set of nonempty sets. Then there is a function 
H:C—+C suchthat H(A) €A forall AEC. 


It was then left as an exercise to show that Theorem 3.3.24 implies the axiom 
of choice. Hence, the theorem is actually equivalent to the axiom. Moreover, the 


axiom of choice is equivalent to a number of seemingly unrelated theorems. In 
this chapter, we will show that the axiom implies two other results that are also 
equivalent to it; but before we do this, we mention an application of the axiom of 
choice to topology, which concerns “product sets.” 


Let (B;:i¢€ J) be an indexed function with nonempty terms. The product 
set, denoted by [|,;-, B;, is defined by 


[] 8 = {Ff : f is a function, dom(f) = I, and (Vi € 1)(f(i) € Bi}. 
ic] 


(7.1) 


Typically, when / is an infinite set, the only way to conclude that this product 
set is nonempty is to appeal to the axiom of choice as follows: Let (a; : i € /) 
be a choice function such that x; € B; for all j € J. Define f: J + U;<; Bi 
by f(i) = a;. Then f € [[;~, Bi, and thus, the product is nonempty. 


Product sets are used to create a variety of topological spaces and to prove 
Tychonoff’s compactness theorem, a classic result in topology. In Chapter 9 (see 
Exercise 19 on page 226), one is asked to complete a proof of a theorem on 
product sets due to Julius K6nig. 


Many important consequences of the axiom of choice can be proved from the 
following weaker version. 


Countable Axiom of Choice. Suppose that (A; : i € /) is an indexed function 
with nonempty terms. If J is countable, then there is a function (2; : i € /) 
such that x; € A;, forall jE 7. 


Thus, if @ is a countable set of nonempty sets, then the countable axiom of 
choice implies there is a function H: C —|JC such that H(A) € A for all 
A €C. The axiom of choice, of course, implies the countable axiom of choice, 
but not conversely. 


Finally, we remark that the replacement axiom shall not be applied in this 
chapter; however, it will be used in Chapters 8 and 9. 


7.1 Zorn’s Lemma 


Zorn’s Lemma, also known as the maximum principle, is an important theorem 
about partially ordered sets that is normally used to prove the existence of a 
mathematical object when it cannot be explicitly produced. According to Max 
Zorn, he first formulated his principle at Hamburg in 1933. Zor published this 
maximum principle in 1935. Amil Artin then used Zorn’s principle to establish 
theorems in algebra. Artin later proved that the principle implies the axiom of 
choice. 


In 1935, Zorn proposed adding his maximum principle to ZF, the axioms of 
set theory. Although Zorn was not the first to suggest such a principle, he 
demonstrated how useful his formulation was in applications, particularly in 
topology, abstract algebra, and real analysis. Zorn also asserted (but did not 
prove) that his lemma and the axiom of choice are equivalent. To review the 
concepts of a maximal element and a chain, see Definitions 3.4.6 and 3.4.14. 


Zorn’s Lemma 7.1.1 (AC). Let (A, <) be a partially ordered set. If every chain 
in A has an upper bound, then A contains a maximal element. 


Our proof of Zorn’s Lemma 7.1.1 will apply “proof by contradiction”; that is, 
we shall let (A, =) be a partially ordered set in which every chain has an upper 
bound. We will then assume that A has no maximal elements and thereby 
derive a contradiction. The proof will depend on a few lemmas and definitions, 
which we now present. 


Let (A, <) bea partially ordered set and S C A. Recalling Definition 3.4.8, 
an element 4 €A_ is called an upper bound for § when ) satisfies 
(Va € S)(a < b). We will say that 6 is a proper upper bound if, in addition, 
we have thatb¢ S. 


Definition 7.1.2. Let (A, =<) be a partially ordered set in which ¢(* is a chain. Then 


C ip is the set of proper upper bounds for ¢( . 


If (A,<) is a poset and CCA _ is a chain, then 
Cp ={qeEA:c~gq, forall cE Ch. 


Thus, C’, is nonempty if and only if C’ has proper upper bounds in A . 


Definition 7.1.3. Let (’ be achain in A , where (A, <) is a partially ordered set. We 


shall say that ( is cofinal in ©’ when C”’ C C and for all ¢ € C,, there is a 
c’ € C! such that ¢ < ¢’. 


Lemma 7.1.4. Suppose that (A, <) is a poset and let C C A bea chain. If ©” 
is cofinal in @, then C), = Cp. 


Lemma 7.1.5. Let (A, <) be a partially ordered set in which every chain has an 
upper bound. If 4 has no maximal elements, then every chain C’ in A has a 
proper upper bound, and hence, C’, is nonempty. 


Proof. Assume that 4 has no maximal elements. Let ( be a chain in A . Since 
every chain in A has an upper bound, let ) € A be an upper bound for @’. If 
b ¢ C’, then f is a proper upper bound for ©. Suppose } € C’. So, because / 
is not a maximal element, there is a g € A_ such that b ~ q. Thus, g ¢ C and 
q is a proper upper bound for C . LJ 


We now present a proof of Zorn’s Lemma. 


Proof (of Zorn’s Lemma 7.1.1). Let (A, <) be a partially ordered set in which 
every chain C’C A has an upper bound in 4 . We will prove that there is an 
m € A suchthat m is a maximal element. Assume, for a contradiction, that 4 
has no maximal elements. By Lemma 7.1.5 and the axiom of choice, there is a 
function f such that f(C,) €C, whenever © is a chain in 4. As 
CpNC = S , we see that 


f(Cp) € C for all chains C in A. 
(7.2) 


A chain {, in A shall be called a ladder if whenever CC, and 
Crhn kL #0, then f(€ p) is the smallest element in C,ML (see 
Definition 3.4.11 on page 73). In other words, if C C L and © has a proper 
upper bound in J, , then f(C,) € L and f(C,) is the smallest element in [, 
that is a proper upper bound for C’. 


Ladders exist. To confirm this, note that @ is (vacuously) a ladder and that 
@, = A. For another example, let a = f(@,). Since {a} is a chain, we have 
that a < f({a},). One can show that the set L = {a, f({a},)} is also a 
ladder. 


Claim 1. Suppose , C kK where J, and fk are ladders. Let ¢¢€ J, and 
kek.lfk<£@,thnkeL. 


Proof. Let 7, and A are as in the claim. Let ¢ € J, and k € fk . Assume that 
k < . Suppose, for a contradiction, that k ¢ L.. Thus, k ~@. Let C be 
defined by 


C={reLl:2 ~<k}. 
(7.3) 


So, C CL. Since L C K, we also have that C C Kk. Clearly, ke kK isa 
proper upper bound for C’ . Because f/x is a ladder, it follows that 


f(Cp) € K and f(C,) <k. 
(7.4) 


Now, because & ~ ¢ and ¢ € J,, the set @ also has a proper upper bound in 

L. Thus, f(C,)€L. Since KEL and f(C,)€L, (7.4) implies that 

f(Cp) <k. Hence, f(C,) € C by (7.3), and this contradicts (7.2). (Claim 1) 
LJ 


Claim 2. Suppose that 7, and x are ladders. Then either } C K or kK CL. 


Proof. Let {, and Kk be ladders. Assume, for a contradiction, that 1 Z AK and 
K Z L.Hence, there isan x € J, sothataz ¢€ K anday€K so that y ¢ L 
. Let 


C={zEeL:z<2r}n{zEeK:z xy}. 
(7.5) 


Clearly, CC L and C C k. Because C C J, and x € J, is a proper upper 
bound for ©, it follows that f(C,)€L and f(C,) <x. Similarly, as 
CCK and y€ K isa proper upper bound for C, we see that f(C,) € K 
and f(Cc p) <4. So, in summary, we have 
(l)adé K, f(Cp) <2, f(Cp)€K; and (2) y¥¢L, f(C,) xy, f(Cp) € L. 


So f(Cp) <a by (1), and f(C,) ~y by (2). Since f(C,) €L and 
f(Cp) € K, we conclude from (7.5) that f(C,) € C, contradicting (7.2). 
(Claim 2) O 


Now let B= {L € P(A): Lisa ladder} . By the power set and subset 
axioms, we see that is aset. Therefore, 8 is the set of all ladders. 


Claim 3. _) 6 is a ladder. 


Proof. To see that |) 8 is a chain, let x and ¥ be in |)8. Hence, x € J, for 
some [, € 6, and y€ K forsome Kk ¢€ B. By Claim 2, either L C K or 
Kk C L..So, we can assume that 2 and y are in J,. As J, is a chain, either 
“x <y or y < x. Thus, | JB isa chain. Suppose C C |) B has a proper upper 
bound in ()B, say, }. We shall show that f(C,) € JB and f(C,) <b. 
Since h € U B, we have that h € J, forsome J, € B. We will now show that 
CCL.LeteEeC.So, e<6. Because CC U B , it follows that ¢ € K for 
some k € B.By Claim 2, either L CK or KCL. A CL,thenceL. 
If L C K, then (as ¢ ~ and } € L, ) Claim 1 implies that c € L,. Hence, 
CCL, and }¢€ J, is a proper upper bound for C’. Since f, is a ladder, 
f(Cp) € L and f(C,) <b. Consequently, f(C,) EJB and f(Cp) <b. 
Therefore, (_) 8 is a ladder. (Claim 3) OU 


Let £=\JB. As B is the set of all ladders, (@) LC £ whenever J, is a 
ladder. Claim 3 states that £ is a ladder. Since £ is a chain, we have by 
assumption that £p is nonempty. Thus, f(£p) € £p. Let £* = £U {f(£p)}. 


Claim 4. £* is a ladder. 


Proof. Since £ is a chain and x < f(£,) for each x € £, it follows that £* is a 
chain. Let C < £* have a proper upper bound / in £*. Thus, 6 = f(£p) and 
CcCé£. If beg, then f(C,)e£C £* and f(C,p) <b, because ¢£ is a ladder. If 


b € £, then b = f(£,), and there are two cases to consider. 


CASE 1: © is not cofinal in £. Thus, © has a proper upper bound w in £. 
Hence, f(C,) €£ and f(C,) =< wu as £ is a ladder. Therefore, f(C,) € £* and 
f(Cp) <b because u < f(£p) = 5. 


CASE 2: © is cofinal in £. Lemma 7.1.4 implies that f(C,) = /(£,) = 6. So 


we Clearly have that f(C,) < £* and f(C,) <b .(Claim4) O 


Since £* is a ladder, (@) implies that £* c £. As f(£p) € £*, we conclude that 
J (£p) € £, and this contradicts (7.2). (Zorn’s Lemma) L] 


Hence, the axiom of choice implies Zorn’s Lemma 7.1.1. Moreover, one can 
also prove that Zorn’s Lemma implies the axiom of choice (see Exercise 10). A 
different proof of Zorn’s Lemma is summarized in Exercise 14 on page 193; 
however, this alternative proof relies on ordinals and the replacement axiom. 


Corollary 7.1.6 (AC). Suppose that (A, <) is a partially ordered set in which 


every chain has an upper bound. For each q@ € A, there exists a maximal 
element 7 € A suchthat a <m. 


Proof. Let (A, =<) be a poset in which every chain has an upper bound, and let 
a€A. Define B={xEA:axzx} and <xp=<NBxB. Thus, 
(B, <p) is a partially ordered set (see Exercise 14 on page 76) in which every 
chain has an upper bound. By Zorn’s Lemma, 8 has a maximal element 7m . So 
a <m and m isa maximal elementin 4. LI 


7.1.1 Two Applications of Zorn’s Lemma 


Zorn’s Lemma is frequently applied to posets of the form (*,C) where F isa 
set (of sets) and C is the subset relation (see Problem 1 on page 70). In such 
applications, one proceeds as described in Exercise 5. We will now present two 
theorems whose proofs apply Zorn’s Lemma as outlined by Exercise 5. 


The first theorem, often called the comparability theorem, shows that one can 
compare the cardinalities of any two sets (see Definition 5.4.14). First, we shall 
state and prove a relevant lemma. 


Lemma 7.1.7. Let P={hCAx B:h is a one-to-one function}. If 


f € P isa maximal element in the poset (P,C), then either dom(f) = A or 


ran( f)= 28. 


Proof. Let (P,C) be the poset defined in the statement of the lemma, and let f 
be a maximal element in P. Clearly, dom(f) CA and ran(f) C B. We 
shall prove the either dom(f)=A or ran(f)= 8B. Suppose, for a 
contradiction, that dom(f) # A and ran(f) # B.Let a€ A\ dom(f) and 
be B\ ranf. Now let g = fU{(a,b)}. Clearly, g C Ax B. Because 
a ¢ dom(f) and b ¢ ran(f), it follows that g is a one-to-one function. Thus, 
g € P. Since f Cg and f # gq, this contradicts the fact that f is maximal. 
Therefore, either dom(f) = A or ran(f) = B.O 


We can now state and prove the comparability theorem, which is actually 
equivalent to the axiom of choice (see Exercise 23, page 218). 


Theorem 7.1.8 (AC). For any two sets A and B, either |A] <.|B| or 
|B| <c |A| - 


Proof. Let A and B be sets, and let 
P={hCAx B:hisa one-to-one function}. We will show that every 
chain in the poset (P,C) has an upper bound. To do this, suppose that C C P 
is a chain. Thus, for all f €C and g €C, we have that either f C g or 
g C f. Clearly, (JC C A x B. Exercise 11 on page 68 shows that JC is a 
one-to-one function. So | JC € P and f C JC for each f € C. We conclude 
that |)C is an upper bound for ¢ . Hence, by Zorn’s Lemma, the poset (P, C) 
has a maximal element f. Lemma 7.1.7 asserts that either dom(f) = A or 
ran(f) = B.If dom(f) = A, then |A| <,|B| because f: A—- B is one- 
to-one. If ran(f) = B, then |B| <.|A| as f~!: B-— A is one-to-one by 
Corollary 3.3.15 and Theorem 3.2.7. LJ 


Our second theorem, identified as the order-extension principle, shows that 


any partial order on a set can be extended to a total order on the same set. A 
proof of this theorem was first published by Edward Marczewski in 1930. Our 
proof will apply Zorn’s Lemma to a partial order on a set of partial orders. We 
first state and prove a useful lemma, which shows that if a partial order is not a 
total order, then it can be extended to a larger partial order. 


Lemma 7.1.9. If < is a partial order on A that is not a total order on A, then 
there is a partial order <’ on 4 such that =~ C <’ and = # =’. 


Proof. Let < is a partial order on A that is not a total order on A. Since ~ is 
not a total order, there exist q € A and } € A such that va aXxb andbxa 


Let RC Ax A be defined by & = {(t,y) € Ax A: x <a and bx y}. 
Because = is reflexive on A , we see that a = a and b <b}. Thus, (a,b) E R 
.Let <’= <UR.So <C =’. Also, < # <’ as (a,b) € - and (a,b) ¢ = 
by (a ). We will now show that ~<’ isa sartial order on A. Since x¥’= UR 


, we have that 


a <' y if and only if2 <yor (x,y) € R. 
(7.6) 


Let « € A. Since w < x, we see from (7.6) that 7 </ a. So ’ is reflexive 
on 4. To show that =’ is es suppose that w <’ y and y </ a. We 
shall show that © = y. Since x <’ y and y =‘ x, the equivalence in (7.6) 
yields the following four cases ee of which, as we will see, cannot hold): 


1. «x xy and y <x. Since = is antisymmetric, we conclude that ¥ = Y. 

2. «xy and (y,2)ER. So (a) «xy, (8) yxXa, (7) bX<a 
Conditions (@ ) and (3) imply that 2 < a. Hence, (7) implies that 
bh < a, which contradicts (a ). 

3. (x,y) € R and y = x. As in the previous case 2, this is impossible. 

4. (a,y)€R and (y,2) ER. Thus, (a2) axa, (8) by, (9) 
yxa, (6) bx<a. Clearly, (8) and 7) imply that hxa 


contradicting (A). 


Because case 1 is the only possibility, we conclude that <’ is antisymmetric. 
Finally, we now show that ~’ is transitive. Assume x2 </ y and y </ z. We 
must show that a <’ z. As x </ y and y =’ z, the equivalence (7.6) again 
yields four cases to consider: 


1. «x xy and y Xz. Since = is transitive, we conclude that x < z. 
2. «xy and (y,z) € R.Thus,(a)rxy,(8) yx<a,(V¥) bz. As(a 


) and (8 ) imply that 2 < a, we see from (7) that 2 < a and } < z. Hence, 
(z,z) ER. 

3. (a,y)€ R and y<z.So(a)xxa,(8)bx<y,(7) y XZ. Since (8 
) and (7) imply that b < z, we rouelnde from (a) that « <a and bx 
Thus, (2, z) € R. 

4. (x,y) € R and (y,z) € R. Hence, (a)axa,(B)bxy, (VY yxa,( 


6) b < z. Clearly, (3 ) and (7) imply that ) < a, contradicting (A). 


In each of the valid cases, we have either « < z or (a,z) ER. Hn by (7.6), 
x <' z.Hence, <’ isa partial order on 4 ah that < C =’ <Z#/.U 


We can now show that Zorn’s Lemma implies the order-extension principle. 


Theorem 7.1.10 (AC). Let <’ be a partial order on A . Then there exists a total 
order < on A such that <’ C x 


Proof. —— <' be a_ partial order on 4A, ~ and let 
A x A: isa partial order on A}. 


Hence, =<’ € P.. We now show that every chain in the poset (P,C) has an 
upper bound. Let C C P bea chain. Thus, for all 4 € C and < € C, either 
<C < or ¥ CS. Clearly, JC C Ax A. Exercises 17 and 18 on page 76 
imply that |)C is a partial order on A. So (JC € P and ¥ CVC for all 


< €C. Hence, U C is an upper bound for ¢ . Hence, by Corollary 7.1.6, there 
is a maximal element < ¢€ P such that <’ C x. Since =< is maximal, 
Lemma 7.1.9 implies that < is a total orderon 4 . LJ 


Exercises 7.1 


1. Let (A; -4E I) be an indexed function so that A; ae @ ,foralzge ]. 
Without appealing to the axiom of choice, prove that if 'S) ic! A i is countably 
infinite, then there is a function Ad -I- UU ie A ; such that f (i ) c A; ; 
forall z € J. 

2. Let RC Ax B, and let R(x, y) denote (2, y) € F. Now suppose 
that for all gy € A, there isa y € PB such that R(x, y) . Using the axiom 
of choice, show that there is a function f ; A => B such that R( x. f (x)) 
foralz € A. 

3. Let RC A x A. and let R(a, y) denote ie. y) € RF. Suppose that 
A a @ and forall a € A, there isa ye As such that R(a, y) . By 
Exercise 2 and the axiom of choice, there is an f : A - A’ such that 
R(a, f (a)) for all gy € A . Using Theorem 4.2.1, show that there is a 
function },: yy —s A suchthat R (h(n), h(n*)) forall 2 € w. 

4. Prove Lemma 7.1.4. 

*5. Let >. b C) be a partially ordered set where F is a set (of sets). Suppose that 


for every chain © C we have that UU C isin F. 


(a) Let C' C F bea chain. Prove that U (” is an upper bound for (. 
(b) Using Zorn’s Lemma, show that there exists an |)f € F that is 
maximal, that is, /)f is not the proper subset of any 4 € F. 


6. Let ie A-B and 
F={X CA:(f | X): X > Bis one-to-one}. Consider 
the partially ordered set LF P C) ; 


(a) Using Zorn’s Lemma, show that there is an Vf € that is maximal. 


(b) Prove that if Ff is onto #3, then (f | M): X 38 isa 


bijection. 
7. Let (A, <) be a poset. Let 
F ={D € P(A): D isa chain in A}. Consider the partially 


ordered set ee : 


(a) Let (” beachain in F. Show that L) C isachainin A . 
(b) Conclude, via Zorn’s Lemma, that there exists achain J) in A. that is 
maximal, that is, J) is not the proper subset of any other chain in 4 . 


8. Let PP be the set of nonzero natural numbers. Then ( P. |) is a poset where | is 

the divisibility relation on # (see Problem 2 on page 71). Define a partial order 

= on P such that | C < and? & 3. 
9. Using Theorem 7.1.10, show there is a total order = on P(w) such that for all 

x and Y in P(w), if « C y, then x = y. Show that ~ # C on 
“10. Let (A ee ;e] ) be an indexed function with nonempty terms, and let 

F = {g:g is a function, dom(g) C J, and (Vi € dom(g))(g(z) € A;)}. 


Thus, (F ‘ C) is a partially ordered set. Assuming Zorn’s Lemma, prove the axiom of 


choice as follows: 


(a) Let C’ C F bea chain. Prove that U C’ isin F and is an upper bound for 
Cc. 
(b) Conclude, via Zorn’s Lemma, that 7 has a maximal element /) . 


(c) Show that dom(h)=J and (Vi € I)(h(i) € Aj). Thus, 
(h(i) "2ZE I) is a choice function. 


Exercise Notes: The conclusion of Exercise 3 is called the axiom of dependent 
choices (DC). For Exercise 6(a), apply Exercise 5. Exercise 7 shows that Zorn’s 
Lemma implies the Hausdorff maximal principle, which proclaims that every 
partially ordered set contains a maximal chain. This principle is also equivalent 
to the axiom of choice. For Exercise 10(a), see Exercise 11(a) on page 68. 


7.2 Filters and Ultrafilters 


We now introduce the concept of a filter, which has a variety of applications in 
set theory and in topology. Filters were introduced by Henri Cartan in 1937 and 
subsequently used by Bourbaki in the book Topologie générale. 


The motivation for our next definition is to give meaning to the intuitive 
notion of a “large” subset of XY . 


Definition 7.2.1. Let XY be nonempty and let /’ C P( X) . Then /’ isa filter on XY 


if it satisfies the following three conditions: 


(1) @ S Fad X € F. 
QurYe Rad ZePhnY Ze. 
(3) If Y € F and Y C = = X ,thn JZ € F. 


Soif Y € F and F isa filter, then y behaves like a “large” set. 


If Ff isa filter on X and A € F, then X \ A ¢ F. To prove this, suppose 
that X \ A € F’. Thus, by Definition 7.2.1(2), @ = AN(X \ A) € F, which 
contradicts Definition 7.2.1(1). 


Clearly, the singleton set {X} is a filter on any nonempty set X . For 
another example, let § be a nonempty subset of X and _ let 
F ={ACX:S CA}. One can verify (see Exercise 1) that /’ satisfies the 
conditions of Definition 7.2.1. Hence, /’ is a filter on X¥ , and it is called the 
filter on X generated by §. When § isa singleton, then /’ shall be called a 
principal filter. For example, if q@ € X, then the principal filter generated by 
{a} can be expressed by 


F={AC X:a€ A}. 
(7.7) 


Definition 7.2.2. Let }:’ be a filter on ¥ . We say that /’ is a maximal filter if for all 


filters G on X if F CG ,thn P—G. 


If a € X, then the principal filter / generated by {a} is a maximal filter. 


To see why it is maximal, let G bea filteron ¥ sothath CG.Let BEG. 
To prove that B € F,, assume B ¢ F’. Thus, a¢g B and ac X \ B. So 
X\ BEF, and hence, X\ BEG. Since BEG, we conclude from 
Definition 7.2.1(1) that O = BN(X \ B) €G, a contradiction. So Be F 
and #’ = G . On the other hand, for q@ € X and h € X where a # b, one can 
verify that the filter generated by {a,b} is not maximal. 


We now present an example of a nonprincipal filter. Let X be an infinite set. 
Let /’ be defined by F={SCX: X\S is finite}. 


One can show that f-’ is a filter on . For example, to verify 
Definition 7.2.1(2), let S and fly be in F. Since 
X\ (SNOT) =(X \ S)U(X \ T), we conclude that X \ (SMT) is finite, 
as X \ S and X \ 7 are finite. Thus, SM 7 isin Ff’. A set § is said to bea 
cofinite subset of X , if SC X and X \ S is finite. So the filter #” is called 
the filter of all cofinite subsets of X . To show that F’ is not a principal filter, let 
Aé F and aé€ A. Consider the set B = X \ {a}. Then B € F (because 
X \ B= {a} is finite) and a ¢ B. Therefore, / is not generated by any 
singleton. 


Definition 7.2.3. Let J+’ be a filteron ¥ . Then /"’ is said to be an ultrafilter on YX if 


forall Y C X , either Y € F or X \ Yer. 


Let a € X. The principle filter / generated by {a} (see (7.7)) is an 
ultrafilter. So there are principal ultrafilters. Do nonprincipal ultrafilters exist? 
Let .X be an infinite set and let F’ be the filter of all cofinite subsets of ¥ . Our 
goal in this section is to prove a result that implies the existence of an ultrafilter 
U/ such that # Cd. Exercise 9 then implies that 7/ is nonprincipal. Thus, 
nonprincipal ultrafilters do, in fact, exist. 


The following lemma identifies two important properties that an ultrafilter 


possesses. 


Lemma 7.2.4. Let 7/ be an ultrafilter on X . 


(1) Foral AC X and BC X,if AUBEU, then either ACY or 
Beu. 
(2) Let F bea filter on X .If&¢ C F’, then Y/ = F’. 


Proof. The proofs of (1) and (2) are left for Exercises 7 and 8, respectively. L 
Lemma 7.2.4(2) shows that an ultrafilter is maximal. Furthermore, if a filter is 


maximal, then it is an ultrafilter (see Exercise 15). 


Definition 7.2.5. Let and G c P(X) be nonempty. Then ¢; has the finite 


intersection property if whenever 5 ( G' is nonempty and finite, then () S f- QO. 


Lemma 7.2.6. Let X and G C P(X) be nonempty. Suppose that G has the 
finite intersection property. Then there is a filter # on XY suchthat GC F. 


Proof. Let G C P(X). Assume G has the finite intersection property. Define 


F {A E P(X): ()s C A for some nonempty finite S C c} 


Clearly, GC F, @ € F, and X € fF’. Weshall prove that F is a filter; that is, 
we will establish items (2) and (3) of Definition 7.2.1. Let Y € F and Ze F. 
So ()S CY and (\}7 CZ where SCG and 7’ CG are nonempty and 
finite. Thus, ()SN()TCYOZ. Exercise 12 on page 40 implies that 
OSONT =(\(SUT). Hence, (SUT) CYNZ. Since SUT is a 


finite subset of G (see Exercise 15 on page 116), we have that YM Ze F’. 
Thus, (2) of Definition 7.2.1 holds. To prove (3), let Y € F and Y CZCX. 
Since ()S C Y for some nonempty finite S CG, it follows that Z € F. 
Therefore, /’ is a filter. LJ 


Definition 7.2.7. Let ¥ and GC P (X) be nonempty. Then ¢; is closed under 


finite intersections if whenever S (C G’ is nonempty and finite, then () SEG. 


One can prove by induction that a filter is closed under finite intersections, 
and therefore, a filter has the finite intersection property (see Exercise 2). 


Lemma 7.2.8. Let F be a filter on  . Suppose that A C X is such that 
A¢F and X \ A¢F. Then the set F U{A} has the finite intersection 


property. 


Proof. Let /’ be a filter on  , and suppose that 4 C X is such that A ¢ F 
and(a) X \ A ¢ F’. Because F isa filter, (a) implies that AAO. 


Claim. For every Y € F’, wehave that YNAO. 


Proof. Assume, for a contradiction that ¥Y M7 A = @ for some Y ¢€ F’. Since 
YNA=6 and Y C X, it follows that Y C X \ A. Thus, because F’ is a 
filter, we have that X \ A € fF’, which contradicts (a). (Claim) UO 


Let SC F U{A} be finite and nonempty. So either A¢ S or Ac S. If 
A€S, then § C F. As F has the finite intersection property, it follows that 


SAQ.If AE S,thn S=TU{A} forafinite TC F.If T=@, then 
()S =A, and thus, ()}S7@. If T7490, then one can show that 
()S = AN(f)\T). Since F’ is closed under finite intersections, (]7 € F’. 
Hence, AN ((\7) 4 @ by the above claim. Thus, (}S # @ and F U{A} 
has the finite intersection property. (Lemma) LJ 


Lemmas 7.2.6 and 7.2.8, together with Zorn’s Lemma, imply the following 
theorem, which has many applications in mathematics. 


Ultrafilter Theorem 7.2.9 (AC). If / is a filter on .¥ , then there is an 
ultrafilter 24 on X suchthat FP CY. 


Proof. See Exercise 14. LJ 


7.2.1 Ideals 


Filters allowed us to investigate the concept of a “large” subset of a set XY. 
Ideals allow us to pursue the concept of a “small” subset of .¥ . As we will see, 
given a filter on YX , one can construct an ideal on X , and conversely. 


Definition 7.2.10. Let be nonempty and let | C ‘P (X) . We say that J is an ideal 


on X if it satisfies the following three conditions: 


1) @EladX €]. 
Q@tyefadZes,thnyuZszel. 
QurYyEefadZCy,teaZef. 


The singleton {@} is an ideal on any nonempty set ¥ . Let SC X bea 
proper subset and let 


I={ACX:ACS}. 


The set / is an ideal, since it clearly satisfies the conditions of Definition 7.2.10. 
This ideal 7 is called the ideal on X generated by §. 


The following lemma shows that there is an intimate connection between 
filters and ideals. 


Lemma 7.2.11. Let x be nonempty. Then the following hold: 
(1) If F isa filter on X , the X¥p={BeE P(X): X\ Be F} is an ideal 
on X. 


(2) If 7 is an ideal on x , the X; = {AE P(X): X\ AE T} isa filter on 
ye 


Proof. See Exercises 19 and 20. LJ 


Let / and J beas in Lemma 7.2.11. Then X- is called the dual ideal, and 
X, is called the dual filter. It is easy to show that Xx, = F and Xx, =I. 


Definition 7.2.12. Let { be anidealon X . Then / is said to be a prime ideal on Y if 


forall Y C X ,eithe Y € fF or X \ Yel. 


Theorem 7.2.13. Let ¥ be a nonempty set. Then 7/ is an ultrafilter on Y if 
and only if 7, is a prime ideal on xX . 


Proof. See Exercise 21. L 


Finally, let J be an ideal on ¥ . Then / is said to be a maximal ideal if for 
all ideals J, if J C J, then J — Jj. One can prove that a prime ideal is 


maximal (see Exercise 22). 


Exercises 7.2 


“1. Let S C X where § is nonempty. Let {A CASS A} ; 
Prove that J’ isa filteron YX . 

*2. Let f be a filter on  . Prove that f/’ is closed under finite intersections. 
Conclude that /*’ has the finite intersection property. 

3. Let q@ € X and h € X where a of  . Prove that the filter generated by 
{a, b\ is not maximal. 

4. Let f be a filter on a nonempty set YY. Let A € fF’. Prove that 
P(A) .) F isa filteron 4. 

5. Suppose that is an infinite set, and let 4 be a cofinite subset of ¥ . Prove 
the following two items: 


(a) If AC BC X, then PF isacofinite subset of X . 
(b) If (* is acofinite subset of A , then (” is acofinite subset of X . 


6. Let / be the filter of cofinite subsets of WJ . Show that /’ is not an ultrafilter. 

*7, Prove Lemma 7.2.4(2). 

* 8. Prove Lemma 7.2.4(3). 

9. Let ¥ bean infinite set and let /*’ be the filter of all cofinite subsets of .¥ . Suppose 
G isafilteron X suchthat ’ CG. 


(a) Prove that every 4 € G’ is infinite. 
(b) Prove that ({' is nonprincipal. 


(c) (AC) Conclude that there is nonprincipal ultrafilter on X . 


10. Let FF bea nonempty set of filters on _¥ . Prove that () F isafilteron XY. 
11. Let F’ be filteron X andletqa € X . 


(a) Prove that if {a} € F’,thng € A forall AE F. 
(b) Suppose that /’ is an ultrafilter on XY and {a} € fF’. Conclude that F’ is 
the principal filteron generated by {a} : 


12. Let Zf be anultrafilteron X . 


(a) Suppose there is a finite set in 7f . Prove that {a} € UY forsome q en. 
(b) Conclude that if 24 is nonprincipal, then every 8 € 2{ is infinite. 


13. Show that P(X) \Uu {B Ee P(X) oe \ Be u} , whenever 7 is 


an ultrafilter on X . 
* 14, Let PF be a filter on Pd Let 


S={ECP(X): Eis a filter on X}. Note that Ff € S. Consider the 
partially ordered set (S , C) ‘ 


(a) Let C a S bea chain. Prove that L) C é€S and that U C is an upper 
bound for (. 

(b) Show, via Corollary 7.1.6, that S has a maximal element ~f such that 
FCU. 


(c) Using Lemma 7.2.6 and Lemma 7.2.8, prove that 7/ is an ultrafilter. 


“15. Let fF’ be a filter on  . Prove that f’ is an ultrafilter if and only if fr’ is 
maximal. 
16. Let F be a filter on A. Let 


AB ={f: f isa function from A to B} . Define the relation ~ on 


AB by f ~g if and only if {2 € A: f(x) = g(a)} E F 


for all Pg and J in A FB. Prove that ~ is an equivalence relation on A B. 


17. Let Uu be an ultrafilter on A. Let 
AB—={f:f isa function from A to B}. Suppose that =< is a total 
preorder on A. _ Define’ the _ relation =<* on AB by 


f <* g if and only if {x eG A : f(z) =< g(x)} E Uu 


forall f and g in 4p. Prove that <* is total preorder on 4p. 


18. Let 2 be an ultrafilter on A and let f: A— B. Define the set 
us ={Y eP(B): f[Y] eu}. 


Prove that 7/* is an ultrafilter on PB . 


*19. Prove Theorem 7.2.11(1). 

* 20. Prove Theorem 7.2.11(2). 

*21. Prove Theorem 7.2.13. 

* 22. Suppose J isa prime ideal on .Provethat is a maximal ideal. 


Exercise Notes: For Exercise 6, find a set and a set that violate 

Lemma 7.2.4(1). For Exercise 12(a), let. bea finite set such that for all _, if 
is finite then . Using Lemma 7.2.4(1), deduce that is a singleton. For 

Exercise 12(b), use Exercise 11. For Exercise 18, see Exercise 9 on page 68. 


7.3 Well-Ordering Theorem 


The following theorem is due to Ernst Zermelo and states that every set can be 
well-ordered. Therefore, the powerful principle of transfinite recursion can be 
applied to any set. 


Well-Ordering Theorem 7.3.1 (AC). Each set has a well-ordering. 


Proof. Assume the axiom of choice. Let’ beaset. We shall use Zorn’s Lemma 
to prove that there is a well-ordering on .Let _ be the following set: 


(Theorem 2.1.3 implies that is a set). Define the continuation relation on 
by 


(7.8) 


where and __ in(7.8)(ii) are arbitrary. The relationship __ is illustrated by the 
following diagram (7.9) where the order — continues the order _ ; that is, the 
order — can only add new elements that are greater than all of the elements 
ordered by (see Exercise 3): 


(7.9) 


The structure is a partially ordered set (see Exercise 4). Let be a chain. 
Thus, forall and ,either or .Nowlet  betherelation with field 


, which is a subset of 


Claim 1. is a total order on its field. 


Proof. Exercise 18 on page 76 implies that is a partial order on its field. To 
show that isatotal order, let and  beinthefieldof . Thus, and 
forsome and in  .Since isa chain, it follows from (7.8) that either 

or . Without loss of generality, let us assume that .Hence, and .As 

is a total order, we see that either or .Since  , we conclude that either 

or .Therefore, isatotal order.(Claim1) UO 


Claim 2. _ is a well-ordering on its field. 


Proof. Suppose that is nonempty. So let .Hence,( ) forsome .We 


now show that . Since ,weseethat .Assume .So_. Thus, for 
some .Wemust verify that .Since , ,and_ isa chain, we have either 
or 


Case1: .So (7.8)(ii) holds. We know that’ .Since by(_), we have that 
because is reflexive. Thus, and . From (7.8)(ii), we conclude that , 
and so 


CASE 2: _. Item (i) of (7.8) implies that . Because ,weinferthat , and 
therefore, 
So. Thus,( ).  . Since __, the set is nonempty. As is a well- 


ordering, there isa  -least element . Because ,( )impliesthat is the 
-leastelementin) .So isa well-ordering on its field. (Claim 2) LU 


Claim 3. isin  and__ is anupper bound for 


Proof. Claim 2 implies that’ isin  .Let .Weprovethat . Clearly, 
Thus, (i) of (7.8) holds. For (ii), suppose that. and . Since , thereisa 
such that .As is achain, we have either or  . The arguments used in 


case 1 and case 2 of the proof of Claim 2 show that . Therefore, . (Claim 3) 
L] 


By Zorn’s Lemma, there exists a maximal element in  .Hence, isa 
well-ordering onaset and _ forall .Wenowprovethat  . Suppose, fora 
contradiction, that there isan  suchthat .Let  . One can easily show (see 
Exercise 5) that is a well-ordering on andtherefore, . In addition, one 
can also show that __, which contradicts the maximality of .Hence, and 
is a well-ordering on _. (Theorem) 


The proof of Theorem 7.3.1 now shows that the axiom of choice implies that 
every set can be well-ordered. Moreover, in Exercise 1 you are asked to show 
that the well-ordering theorem implies the axiom of choice. Another proof of 
Theorem 7.3.1 is outlined in Exercise 15 on page 194. This alternative proof 
depends on the replacement axiom. 


Exercises 7.3 


1. Let be an indexed function such that forall) . By Theorem 7.3.1, the 


set has a well-ordering| . Define a function such that and. Thus, 

is a choice function. 
2. Let be atotal orderon| .Foreach ,let . Suppose that for every and 
, if is nonempty, then hasa| -least element. Prove that is a well- 


ordering on 


3. Let be a well-ordering on and let be a well-ordering on 
Suppose that (i) and (ii) whenever and), then 


(a) Prove that 
(b) Prove that 
(c) Prove thatif) , then for some 


4. Let be the relation defined on in the proof of Theorem 7.3.1. Prove that 


is a partial ordering on 


5. Let be a well-ordering on a set and let| . Define a relation on 
by 


(a) Show that is well-ordering on 
(b) Prove that where is the continuation relation defined by (7.8) in 
the proof of Theorem 7.3.1. 


6. Suppose that is a well-ordering on and that is a well-ordering on 
Let be the lexicographic ordering on __. Prove that is a well-ordering. 


Exercise Notes: For Exercise 6, see Exercise 19 on page 76. 


8 
Ordinals 


We can easily count the number of elements in a finite set; but can we count the 
number of elements in an infinite set? If a set A is countably infinite, then 
Theorem 5.2.6 suggests that 4 has w many elements. Is w a number? In order 
to count the number of elements in infinite sets, we need to define some new 
numbers that will allow us to count far beyond the natural numbers. These new 
numbers are called ordinals. 


8.1 Ordinal Numbers 


Each natural number n € w is a transitive set, and the set w is also a transitive 
set (see Theorems 4.1.11 and 4.1.13). In addition, Theorem 4.4.13 shows that w 
is well-ordered by the relation €. By extending these concepts, we can easily 
define new counting numbers. For sets « and ¥, we shall write x € y if and 
only if ey orx=y. 


Definition 8.1.1. An ordinal is a transitive set that is also well-ordered by the relation € . 


Thus, the set w is an ordinal, and each natural number is also an ordinal. Let 
a be an ordinal. Then @ is a transitive set, and the relation € is a well- 
ordering on a@. Therefore, in particular, € is a total order on a (recall 
Definitions 6.1.1 and 3.4.3). Hence, Lemma 3.4.5 implies the following result. 


Lemma 8.1.2. Let a be an ordinal. Then forall 2, Y, z ina, 


(a) a géa,(b) if «cy, then y¢ga,(c) if x ey and y€ z, then gE z, 


(d) exactly one of the following holds: € y,or™ =Y,oryEe2. 


An ordinal @ is a transitive set. So if « € y € a, then we can conclude that 
x € a. We Shall use lowercase Greek letters a, 3,-y, 7, v,...to denote ordinals. 


Lemma 8.1.3. The following hold: 


(1) © isan ordinal. 

(2) If @ is an ordinal, thena ga. 

(3) If @ is an ordinal and 7 € @, then 7 is an ordinal. 
(4) If @ is an ordinal, then q* is also an ordinal. 


Proof. (1) This follows vacuously. (2) Suppose @ is an ordinal. If a € a, then 
this would contradict Lemma 8.1.2(a). Therefore, a ¢ a. 


(3) Assume @ is an ordinal and let y € @. We first prove that Y is a 
transitive set. Let (A) 7) € v and let (@) v € y. We prove that 7 € 7. Since 
qa is a transitive set and vy € y € a, we have that vy € a. Thus, ne vea, 
and we conclude that 7) € a. Now, since y € a and 7) € a, Lemma 8.1.2(d) 
implies that either 7 € y, 7) = Y, or y © 7). If 7) = y, then (A) implies that 
y € v. So Lemma 8.1.2(b) implies that v ¢ yy, which contradicts (@). If y € 7 
, then (@) implies that »€7 by Lemma 8.1.2(c). Hence, 7 ¢v_ by 
Lemma 8.1.2(b), contradicting (A). So we must have 7) € 7, and therefore, ‘y is 
a transitive set. Since 7 GC @ and @ is well-ordered by €, it follows that Y is 
also well-ordered by € (see Exercise 5 on page 145). Therefore, Y is an 
ordinal. 


(4) Suppose that @ is an ordinal, and recall that a* = aU {a} . Exercise 2 
on page 87 implies that the successor q+ is a transitive set. Let A C a®* be 
nonempty. If ANa=@, then A={a} and A clearly has an €-least 
element. Suppose AM a # @. Since @ is an ordinal, there isa wu € ANa so 
that u € x for all e € ANa. Because a is the € -largest element in q”™, it 


follows that « € x forall « € A .Consequently, q+ is an ordinal. UJ 


We will show how to construct an ordinal using any well-ordered set. First, 
for each set x, let ran(a) = {d : de((c, d) € w)} , which is a set (see page 45). 
Now let =< be a well-ordering on a set 4. Let y{a,y) be the formula 
y =ran(a). Clearly, for each x, there exists a unique ¥ satisfying y(a,y). 
By Theorem 6.2.5, there is a unique function #{/ with domain A such that 


H(u) = ran(H | s(u)), for all u € A. 
(8.1) 


Recall that s(w) = {a € A: a <u} is the “less than” initial segment up to u 
(see (6.3) on page 143). Since ran(H | s(u)) = {H(a):a~<u}, we can 
rewrite (8.1) as 


H(u) = {H(a): a ~ u}, for all ue A. 
(8.2) 


Suppose that uw is the <-least element in 4, that v is the <-least element in 
A such that w ~< wv, and that w is the <-least element in A so that v =~ w. 
Bia) = {H(e) :2 <u} =o —0, 


H(v) = {HA (z) : 2 <v} = {A (u)} = {9} =—1, 
Then 7 (w) = {H(x) : 2 < w} = {H(u), H(v)} = {9, {S}} = 2. 


Since (), 1, and 2 are ordinals, it appears that the range of // consists of 
ordinals. We now show that every well-ordered set can be used to construct an 
ordinal. 


Theorem 8.1.4. Let = be a well-ordering on a set A, and let FY be the unique 
function with domain A that satisfies (8.2). If 6 =ran(H), then § is an 
ordinal. 


Proof. Let x, 4 , }], and § beas in the statement of the theorem. 


Claim 1. For all ¢ ¢ A, H(x) € H(2). 


Proof. Suppose, for a contradiction, that the set C = {w € A: H(x) € H(x)} 
is nonempty. Let q@¢A_ be the x-least element in @. Hence, (A) 
H(a) € H(a). Thus, by (8.2), H(a) = H(u) for some u <a. So by (a), 
H(u) € H(u). Thus, we C and u ~<a. Therefore, a is not the <-least 
element in @’.(Claim1) OU 


Claim 2. }/ is one-to-one. 


Proof. Let x € A and y€ A. Assume H(x) = H(y). If « # y, then either 
“x~yYy or Y~<, as = is a total order. Suppose « ~ y. So, by (8.2), 
H(a) € H(y). Since H(a) = H(y), we conclude that H(a) € H(a), which 
contradicts Claim 1. Similarly, the relationship y ~< 2 is also impossible. So 
“=¥Y.(Claim2) O 


Claim 3. If 2 and Y arein 4, then x < y if andonlyif H(x) € H(y). 


Proof. Let « and ¥Y be in A. If x~<y, then H(x) € H(y) by (8.2). 
Conversely, if H(a) € H(y), then H(a) = H(u) for some u ~ y, again by 


(8.2). Since 7 is one-to-one, we see that « = u and consequently, 2 ~< y. 
(Claim 3) O 


Claim 4. § is a transitive set. 


Proof. Let a€ 8 and BEd. As d6=ran(H), let x in A be such that 
H(a) = 8 .Sincea€ 8 and H(x) = {H(y): y < x} by (8.2), we conclude 
that a = H(y) for some y = x. Therefore, @ is in the range of #7 and so, 
aeéeé.(Clam4) UO 


Claim 5. The relation € isa total order on 4. 


Proof. First we show that € is a partial order on §. Leta, 3, and Y bein §. 
As 6 =ran(H), let x, ¥, and z bein A sothat H(x) =a, H(y) = 8, and 
H(z) = 7. Clearly, a € a. Suppose that a € 8 and 8 € a. Claims 2 and 3 
imply that « < y and y <x. Thus, = y since < is a partial order. So 
H(a) = H(y) and a= @. Finally, assume a€ 8 and G€y. Then «xy 
and y < z. Hence, x < y. Thus, by Claim 3, H(x) € H(y). Soa€vy. To 
show that € is a total order, note that since < is a total order on A, we have 
that either « = y or y=. Claim 3 implies that either a€ 8 or BEa. 
(Clam5) O 


Claim 6. The relation € is a well-ordering on 6 . 


Proof. Since 6 = ran(#), the function /] is onto §. Let D C 6 be nonempty. 

Therefore, /{~! [D| is a nonempty subset of A. Let a be the ~<-least element 

in H~*|D]. Let a= H(a). Clearly, a € D. Now let 8 € D be arbitrary. 

Because }] is onto 6 ,leth € H~*|{D] besuchthat 8 = H(b). Since a is the 

<-least element in H~'|D], we see that a@ < . Claim 3 now implies that 

H(a) € H(b). Hence, a € 8 and a is the € -least element in /) . (Claim 6) 
C] 


Therefore, § is an ordinal. LJ 


(Theorem) 


Definition 8.1.5. We shall say that (A, <) is a well-ordered structure when = is a 
well-ordering on 4 . [We shall write (A, <) for the ordered pair (A, =<) | 


Definition 8.1.6. Let (A, =<) and ( B. <*) be well-ordered structures. A function 
f: A —- B isan isomorphism from (A, =<) onto (B : <*) if f is one-to-one, 
tj is onto /? , and f satisfies 


x =< y if and only if f(x) <* f(y) 
(8.3) 


for all g@ and ¥Y in A . If such an isomorphism exists, then (A, =<) and ( B, =<*) 
are said to be isomorphic, and we write (A, =) ~ ( B, <* }. 


We now present a lemma about isomorphic well-ordered structures that will 
be used shortly. 


Lemma 8.1.7. If f: A — B is an isomorphism between the two well-ordered 
structures (A, <) and (B, <*), then 


(1) «~<y if and only if f(a) <* f(y), for all 2 and Y in A; (2) 
{f(z):a~<~a}s={ye B:y <* f(a)},foreachaeA. 


Proof. Let f: A — B bean isomorphism between the well-ordered structures 
(A, <) and (B, <*). First, we prove (1). Let 2 and Y be in A, and assume 
that «2 ~<y. Thus, «<y. So by (8.3), f(x) <* f(y). Suppose that 


f(x) = fly). As f is one-to-one, we would then have «= y, which 
contradicts the fact that 2 < y (see Definition 3.4.4). Therefore, f(a) <* f(y). 
To prove the converse, assume that f(a) <* f(y). Thus, 2 =< y by (8.3). If 
x=y, then f(a)= f(y), which contradicts our assumption that 
f(a) <* f(y). Hence, x < y. 


To prove (2), let @ € A andlet z € { f(x): a <a}. Thus, z = f(x) foran 
x€A where x~<a. Hence, z= f(x) <* f(a) by (1), and so 
ze{ye B:y <* f(a)}. Let ze {ye B:y <* f(a)}. So ze B and 
z <* f(a). Since f is onto B, there isan x € A so that z = f(a). Hence 
z= f(x) <* f(a), and (1) implies that x ~a. Thus, z€{f(x):a~<a}. 
So {f(a):a2~<~a}={ye B:y <* f(a)}. O 


Remark 8.1.8. Let (A, <) be a well-ordered structure. Theorem 8.1.4 and its 
proof show that there exists an ordinal 6 anda one-to-one function }]: 4 -s 6 


that is onto § . The proof also shows that & * if and only if H(x) € H(y) 


for all 2 and ¥ in A. Therefore, 7 is an isomorphism from (A, <) onto 
(6,€). 


Definition 8.1.9. If (A, <) is a well-ordered structure, then the order type of (A, <) 


is the unique ordinal § given by Theorem 8.1.4. 


Our next theorem shows that if two well-orderings are isomorphic, then they 
have the same order type. 


Theorem 8.1.10. If (A,<) and (B,<*) are isomorphic well-ordered 


structures, then (A, <) and (8B, <*) have the same order type. 


Proof. Let (A, <) and (BB, <*) be well-ordered structures. Let f: A— B be 
an isomorphism from (A,=<) onto (B,=<*). Now let H: A-+6 and 
H*: B — 6* be as given in the proof of Theorem 8.1.4 where ran(H) = 6 
and ran(H™*) = 6*. We shall prove that the ordinals § and 6§* are equal. 


By (8.2), we have that 
H(u) = {H (az): x < u}, for all ue A. (8.4) 
H*(v) = {H*(y): y ~<* v}, for allv € B. (8.5) 


Claim 1. H(u) = H*(f(u)) foralue A. 


Proof. Assume, to the contrary, that C= {u€ A: H(u) A H*(f(u))} Ao. 
Let a € A be the <-least element in C.So(a) H(a) #4 H*(f(a)), and 


H(z) = H*(f(2x)) for all ¢ ~ a. 


Hence, 
H(a) = {H(x): 2 ~<a} by (8.4) 
= {H"(f(x)): ~<a} _ by (86) 
{H*(y):y <* f(a)} by Lemma 8.1.7(2) 
= H*(f(a)) by (8.5). 


Therefore, H(a) = H*(f(a)), which contradicts (a). (Claim 1) O 


Since f is onto 8, Claim 1 implies that ran(/7) = ran(H*). So § = 6*. 
L] 


Lemma 8.1.11. If @ is an ordinal, then @ is the order type of the well-ordered 
structure (a, €). 


Proof. See Exercise 2. LJ 


Theorem 8.1.12. Let @ and § be ordinals. If (a, €) ~ (8,€), thena= 8. 


Proof. This follows immediately from Theorem 8.1.10 and Lemma 8.1.11. UO 


Theorems 8.1.10 and 8.1.12 imply that every well-ordered set is isomorphic to 
a unique ordinal number, namely, its order type (see Exercise 1). Therefore, our 
next theorem implies that the ordinals can now be used to measure and compare 
the “lengths” of any two well-orderings. 


Theorem 8.1.13. Let a, 3, and Y be ordinals. 


(1) fa€8 and gevy,thnacy. 

(2) Ifa C8,thena€ 8 ora=8. 

(3) Exactly one of the following holds: (i) a € &, (ii) a = 8, or (iii) 
Bea. 


Proof. Let @, 3, and ¥Y be ordinals. We first prove (1). Suppose that a € 8 
and 3 € +. Since 7 is transitive, we see that a € 7. 


To prove (2), suppose aC 8. If af 8, then X = 8\ aq is a nonempty 
subset of 3. Let 7) be the €-least element in ¥ . Because (A) 7) € X , we 
have 7 € G and (@) 7 € a. We will now show that 7) = @. First, we prove 
that 7 Ca. Let “€7. Consequently, vE 8, as nEGB and GB is a 
transitive set. Since / € 7 and 7 is the € -least element in ¥ , we conclude 
that vy € X . Thus, vy € @ and so, 7 Ca. To prove that a C7, let (%) 


v € a. Since a C GB, we see that vy € B.So, as v € @, we conclude that ($) 
v € X . Because yv and 7 are in 3, we have by Lemma 8.1.2(d) that either 
veEn,orv¥=7,orn ev. If’ =7, then p € X by (A), contradicting (@). 
If 7 € v, then (%) implies that 7 € vy € a and thus, 7 € a because a@ isa 
transitive set; but this contradicts (@). Hence, we must have that v € 77. 
Therefore, 7) = @,andsoa€ 8. 


For (3), let 6 = aM 8. We show that § is an ordinal. Let 7 € 6 . So, 7 € a 
and 7» €@. As @ and § are transitive, 7 Ca and 7C@. Thus, 
nCaN8=0.Hence, § is transitive and, in addition, § is well-ordered by 
€ because 6 C @ and aq is well-ordered by €. Therefore, § is an ordinal. 
Since 6 C a and 6 C G, item (2) and Lemma 8.1.3(2) imply that 


(a) either § € a or 6 =a, but not both; (b) either 6 € 9 or 6 = G, but 
not both. 


We will now show that the conjunction “§ € q@ and 6 € 8” is impossible, and 
thus, we must have either (i) a € 3, (ii) a = 8, or (iii) BE a. If both 6€a 
and 6 € § hold, then 6 € aM 8 = 6, and this contradicts Lemma 8.1.3(2). One 
can now easily show that exactly one of (i)—(iii) holds. L 


Corollary 8.1.14. Let a, 3, and ¥ be ordinals. Then 


(1) aE€a. 

(2) Ifa@E€ G6 andSeEa,thna=68. 
(3) Ifa@E€ 8 andBEy,thenaey. 
(4) Eithra€ Gorfea. 


Proof. (1) is clear. For (2), assume that @ € 8 and 8 € a. To prove that 
@ =, suppose to the contrary that a #8. Thus, a€ 8 and Bea. 
Theorem 8.1.13(1) implies that a € a, contradicting Lemma 8.1.3(2). Hence, 
we must have that @ = 3. To establish item (3), suppose a€ 8 and BEY. 
Theorem 8.1.13(1) implies @E€vy. Finally, item (4) follows from 
Theorem 8.1.13(3). U1 


Theorem 8.1.15. Let A and C be sets of ordinals. 
(1) If A is nonempty, then there is an @ € A such that (V3 € A)(a € 8) 


(2) UC isan ordinal. 
(3) ()C is an ordinal whenever C is nonempty. 


Proof. Let A and C be sets of ordinals. To prove item (1), assume that 
A#@.Let \ € A. Corollary 8.1.14(4) implies that 7 € A or A € 7 for all 
nEA. Let S={nEA:nEA}. Since \ES, we have that § is 
nonempty. In addition, 7 €A for all »€S. Thus, $C * by 
Theorem 8.1.13, as \ € \*. Because \* is an ordinal, S contains an € -least 
element a . It now follows that @ is also the € -least element in A. 


We now prove item (2). Since © is a set of ordinals, Lemma 8.1.3(3) implies 
that () C is also a set of ordinals. Corollary 8.1.14 allows us to conclude that € 
is a total order on | JC’. Item (1) now implies that |_JC’ is also well-ordered by 
€. So to show that [JC is an ordinal, we just need to prove that it is a 
transitive set. Let a € @ and let 6 € |)C. Thus, 8 € y foran ordinal y € C 
. Since a € @ and 8 € 4, we see that a € y. Hence, a € Uc . Therefore, 
(JC is an ordinal. For the proof of (3), see Exercise 14. U 


Definition 8.1.16. Let (’ be a set of ordinals. The supremum of (” is the ordinal U Cc 


and is denoted by © . When (" is nonempty, we denote the ordinal () i by mf C. 
which is called the infimum of (” . 


In Definition 8.1.16, the ordinal c is the least upper bound for (’, and jnf C 
is the greatest lower bound for ( (see Exercises 13 and 14, respectively). In 
particular, if @ and § are ordinals, then max(a, 8) = sup{a, 3} =aU@ is 
the larger of the two ordinals. 


We now classify ordinals into three distinct types: A given ordinal is either (), 
a successor ordinal, or a limit ordinal. 


Definition 8.1.17. Let @ be a nonzero ordinal. Then @ is a successor ordinal if 


ray if] " for some ordinal 7). We shall say that Q is a limit ordinal when @ is not a 


successor ordinal. 


Lemma 8.1.18. Let @ be a nonzero ordinal. Then a is a limit ordinal if and 
only if for all \ € q@, we have that \* € q. 


Proof. Let @ #0. Suppose q@ is a limit ordinal. Let \ € q. Exercise 9(b) 
implies that Ay € @. Since @ is not a successor ordinal, we conclude that 
\* € q. For the converse, assume that 


A* € a whenever A € a. 


(8.7) 
Suppose, to the contrary, that @ is a successor ordinal. Thus, 77* = @ for some 
ordinal 7). Since 7 € 7* = a, (8.7) implies that a@ € a, a contradiction. L 


Lemma 8.1.19. If @ is a limit ordinal, then w Ca. 


Proof. Let a bea limit ordinal. By Theorem 8.1.13(3), either @ € w orw € a 
. If a € w, then by Theorem 4.1.6 we conclude that gq — + for some k € w. 
So @ would be a successor ordinal. Thus, we must have w € @. Therefore, by 
transitivity, w Ca. U 


Theorem 8.1.20 (Burali—Forti). There is no set that contains all of the ordinals. 


Proof. Assume, to the contrary, that there is a set 4 in which each ordinal is a 
member. By the subset axiom there exists a set /3 such that 


B= {a€A:aq is an ordinal}. 
(8.8) 


Thus, § is the set of all ordinals. Corollary 8.1.14 allows us to conclude that € 
is a total order on #3 . Theorem 8.1.15(1) implies that 8 is well-ordered by € . 
We now show that # is a transitive set. Let a € 8 and 8 € B. Then 8 isan 
ordinal, and Lemma 8.1.3(3) implies that @ is an ordinal. So q@ € PB by (8.8). 
Hence, # is an ordinal, and thus, Be PB by (8.8), which contradicts 
Lemma 8.1.3(2). L 


Theorem 8.1.21. Let u(€) be any formula. If there is an ordinal € for which 
w(€) holds, then there is a least ordinal @ such that w(a). 


Proof. Assume that € is an ordinal so that w(€) holds. Thus, the set of ordinals 
A={yE&* :u(y)} is nonempty. By Theorem 8.1.15(1), 4 has a least 
element @ . Hence, «@ is the least ordinal satisfying w(a@). UL 


Theorem 8.1.22 (Ordinal Induction). Let ~(€) be a formula and suppose for 
every ordinal 3 we have that 


if (VE € B)wW(E), then (8). 
(8.9) 


Then we can conclude that 7() holds for all ordinals 3. 


Proof. Assume that (8.9) holds for each ordinal 3 . Suppose, for a contradiction, 
that there is an ordinal € for which —y)(€) holds. By Theorem 8.1.21, there is a 
least ordinal a that satisfies —%(a). Since a is the least such ordinal, it 
follows that (VE € a)u(€). Thus, (8.9) implies that ~(a), a contradiction. O 


So to prove that a statement 7(3) holds for every ordinal @ , it is sufficient to 
show that (8.9) holds for each ordinal 6. Such a proof is called proof by 
transfinite induction, and it, typically, has the following structure: 


Let 8 be an ordinal. 
Assume that (VE € 8)w(&). [induction hypothesis| 

Case 1: Assume 8 = 0. 
Prove 7(8). 

Case 2: Assume @ is a successor ordinal. 
Prove (8). 

Case 3: Assume § is a limit ordinal. 
Prove w(8). 

(8.10) 


We shall present several proofs by transfinite induction in the next section (e.g., 
see Theorem 8.2.9). 


Exercises 8.1 


“4. Suppose that (A, <) ; (B, <*), and (C, <i) are all well-ordered 
structures. Prove the following: 
(@) (A, <) = (A, s). 
(b) If (A, <<) = (B, 4°), then (B, *) & (A, =). 
() If (A,<) = (B, <*) and (B, <*) = (C, <*), then 
(A, x) © (C, x). 


Conclude that two well-ordered structures have the same order type if and only if the 
structures are isomorphic. 


*2. Let Q@ be an ordinal, and so (a, €) is a well-ordered structure. By the proof of 
Theorem 8.1.4, there is a function #7 with domain q satisfying 


H(u) = {H(x): 2 € u} for allu ea. 


Show that H(u) = uw forall a € q@ and conclude that ran(H) =a. 


3. Let @ and % be ordinals. Prove that either @ < 8 or B Qa. 

4. Suppose W © @ where @ isan ordinal. Show that @ is an infinite set. 

*5. Let ‘yY be an ordinal and let 8 € 4. Since ‘Y is well-ordered by € , the initial 
segment up to is the set g‘ (8) {x Evy:rE 8} (see (6.3) on page 143). 
Show that s© (8 ) B. 

6. Suppose (A, =<) is a well-ordered structure, and let JY and ¢; be functions with 


H(u) = {H(a): x ~ u}, for all ue A, 


dentin a: sine G(u) = {G(x): a <u}, for allue€ A. 


Prove, without appealing to Theorem 6.2.5, that }] cs 


*7, Let (A, <) and (B, <*) be well-ordered structures with order type § and 
6*, respectively. Suppose that f: A— B is a one-to-one function satisfying 
x =< y if and only if f(a) <* f(y), forall x and y in 4.Let H and }]* be 
as in the proof of Theorem 8.1.10. Thus, 6 = ran(#) and 6* = ran(H"“). 


(a) Prove that H(u) € H*(f(u)) forall we A. 
(b) Prove that ran(#7) C ran(H"*). 
(c) Prove that 6 € 0*. 


“8. Let (A,<) be a well-ordered structure with order type a. Let 7 be the 
order type of (C,<c) where C C A and ~c is the induced relation (see 
Definition 3.4.15). Using Exercise 7, show that y € a. 

*9. Let a and @ be ordinals. 


(a) Prove that there is no ordinal 7 such that q@ € y € a’. 
(b) Suppose that a € {. Prove that a* € @. 
(c) Prove thatifa € 8,thena* € 8”. 


(d) Prove that ifa* = 8*,thena=Q8. 
(e) Prove thatifa€ 8 ,thnanB=a. 


*10. Let @ be anonzero ordinal. 


(a) Prove that a isa limit ordinal iff (YAEa)(Ay)(AE yea). 
(b) Prove that q@ is a limit ordinal if and only if @ = sup @, 


11. Suppose that (A, <) is a well-ordered structure and a € A. Let s(a) C A 
be defined by s(a) = {2 € A: x < a} . Thus, (s(a@), <.(q)) is a well-ordered 
structure (see Exercise 5 on page 145). Let § be the order type of (A, <) and 
let }?: A — 6 be as in the proof of Theorem 8.1.4. Show that 7) is the order 
type of (s(@), Xs¢a)) if and only if H(a) = 7. 

12. Let (A,<) and (C, <*) be well-ordered structures. Let @ be the order 
type of (A, =) and let 7) be the order type of (C, <*). Suppose that 7 € a. 
Prove that there exists an @ € A such that (s(@),<sa)) ~ (C, <*), where 
s(a)={xtEA:2~a}. 

“13. Let § —C where © isa set of ordinals. 


(a) Show that ifq@ecC,thena€ 0. 
(b) Let ¥ be an ordinal. Show that if a € y foralla € OC, then 6 € 7. 


Conclude that 6 is the least upper bound for C’ (see Definition 3.4.9). 


“14. Let C bea nonempty set of ordinals and let 7 be the € -least element in 
C . Prove that y = infC. 
15. Prove that there is no set that contains every successor ordinal. 


Exercise Notes: For Exercise 1(b), assume f is an isomorphism from (<A, =) 
onto (B, <*). Show that f~! is an isomorphism from (B,<*) onto (A, <). 
For Exercise 1(c), let f be an isomorphism from (A, <) onto (B, <*), and let 
g be an isomorphism from (B, <*) onto (C’, <*). Show that the composition 
go f is the desired isomorphism. For Exercise 7(a), the proof of 
Lemma 8.1.7(1) shows that « ~< y if and only if f(a) <* f(y) forall x and y 
in A. Let @e A be the <-least such that H*(f(a)) € H(a). Thus, 
H*(f(a)) = H(a) for some «~<a. For Exercise 8, by Exercise 5 on 


page 145, (C’, <q) is a well-ordered structure. For Exercise 15, if were a set 
containing every successor ordinal, show that |_|) B contains every ordinal. 


8.2 Ordinal Recursion and Class Functions 


The ordinal numbers form an extension of the natural numbers. Even though the 
collection of ordinals is not a set, it will be useful to have some notation for this 
class. We shall let Qn denote the class of ordinal numbers; that is, let 
On = {a@: @ is an ordinal} . 


Let Y be an ordinal. As ¥Y is well-ordered by €, Theorem 6.2.5 and 
Exercise 5 on page 184 immediately imply the following version of transfinite 
recursion in which we explicitly reveal a parameter! P (see Remark 6.2.6). 


Ordinal Recursion Theorem 8.2.1. Let yo({p, x,y) be a formula such that for 
all P and x, there exists a unique ¥Y such that y{p,«,y). If Y is an ordinal, 
then for each ordinal @ there exists a unique function 7, with domain Y such 
that 


y(a, H. | 8, Ha(8)), for all B E+. 


If it is not necessary to reveal a particular parameter, then we have our next 
result that also follows from Theorem 6.2.5. 


Theorem 8.2.2. Suppose that y(a,y) is a functional formula. If y is an 
ordinal, then there exists a unique function /{J with domain 7 such that 


y(H | 8, H(8)), for all BE +. 


An ordinal class function ¥ is one for which there is a formula 4 (a, y) such 
that F = {(8,y):8€ OnAvw(8,y)} is a “class function.” So for all 


ordinals 8 , F(8) = y if and only if (8, y). 


Thus, an ordinal class function has My as its domain. Since the class of ordinals 
is not a set, it thereby follows that ~ is not a set. Hence, we should be 
somewhat cautious when working with #; however, if we keep in mind that F 
is just an abbreviation for a “functional formula on the ordinals,” then we can 
safely view ¥ as if it were a set. For example, let 9 be an ordinal. Now let 
F\|8)| be the collection F[8] = {F(n):7€ 8}. Since 8 is a set, the 
replacement axiom implies that |G] is a set. Hence, by the subset axiom, the 
restricted function F | 3 is alsoaset. So F can always be “approximated” by 
a set function. For these reasons, we shall informally work with F as though it 
were a Set. 


Now let us explore how one can define ordinal class functions by transfinite 


A 


recursion. In Theorem 8.2.1, the function 7, has domain Y and depends on a; 
however, do the values of H,, depend on 7¥ ? In other words, suppose that +‘ is 
such that y € y’. Theorem 8.2.1 implies that there is a function HH’ with 


Does H(A) = H/(A) for each \ € +? The proof of Theorem 6.2.5 shows that 
the answer is “yes.” Thus, we can define an ordinal class function 1, by 
Ha(A) = y if and only if 

(3y)(SH)[A € yA 0(H,y) A (V8 € y)¢(a, H | 8, H(8)) A H(A) = yl 


where J(H,) abbreviates “#7 is a function with domain 7, and ¥ is an 


ordinal.” It therefore follows that the ordinal class function #,, satisfies 
y(a, He | 8,Ha(8)), for all 8 € On. 


We now record the above conclusions in the following theorem. 


Ordinal Class Recursion Theorem 8.2.3. Suppose that y(p, 2, y) is a formula 


such that for all P and 2, there exists a unique ¥ so that y(p, x, y). Then, for 


each ordinal @, one can define an ordinal class function },, such that 
y(a, Ha | B,Ha(B)), for all 8B € On. 


Moreover, there is a formula @(a, A, y) such that for all ordinals a and \ , we 
have Ha(A) = y iff O(a, A, y). 


Theorem 8.2.3 will be used in the next section to define the operations of 
addition and multiplication on the ordinal numbers. Theorem 8.2.3, by adding a 
“dummy” parameter, implies an ordinal class recursion theorem for formulas 
that may, or may not, contain parameters. 


Theorem 8.2.4. Suppose that the formula (a, y) is functional. Then one can 
define an ordinal class function H. such that 


y(H | 8,H(8)), for all 8B € On. 


The following lemma can be useful in certain applications of Theorem 8.2.4. 


Lemma 8.2.5. Let 4 beaset. If F: On — A is an ordinal class function, then 
F is not one-to-one. 


Proof. Let 7: On — A be an ordinal class function where A is a set. Since 
F is definable by a formula, the subset axiom implies that the range of F 


B={2x€A: (da € On)(F(a) =2)} 
is a set. We shall prove that F is not one-to-one. Suppose, to the contrary, that 


F is one-to-one. Let y(a,a) be a formula that - satisfies 
y(a,a) iff (ae BAF(a)=a)V(a¢é BAa=29) 


for all a and a. Since ¥ is one-to-one, it follows that Vas!ay(a,a). As PB is 


a set, the replacement axiom implies that > {a : dala € BA g(a, a))} 


is also a set. Because F: On — A, we see that (Va € On)(Sa € B)y(a,a). 
Thus, contains every ordinal, contradicting Theorem 8.1.20. LJ 


Theorem 8.2.4 and Lemma 8.2.5 can be used, together, to give new proofs of 
Zorn’s Lemma and the well-ordering theorem (see Exercises 14 and 15). These 
proofs, of course, are different from those that were presented in Chapter 7. 


Normal Class Functions 


We shall now investigate certain ordinal class functions whose codomain is the 
class of ordinals. The results obtained in this section will be used in Section 8.3 
where we define addition and multiplication on the ordinals. 


Definition 8.2.6. A class function FF: (yy —s Oy is strictly increasing when for all 


ordinals @ and 3,if @ € §, then F(a) é F(B). 


The class function F: On-—+ On defined by F(a)=a* is strictly 
increasing by Exercise 9(c) on page 185. 


Definition 8.2.7. A class function -: Qn —+ On is continuous provided that 


F(8) sup{F(n) INE B} for all limit ordinals 9 . 


Definition 8.2.8. A class function F: Cy —s On is said to be normal if it is strictly 


increasing and continuous. 


Our next theorem provides a method for showing that a continuous class 
function -: On —- On is strictly increasing. The proof follows the structure 
given in (8.10) on page 184. 


Theorem 8.2.9. Let F: On — On be continuous. If F(y) € F(y*) for every 
ordinal , then F is strictly increasing. 


Proof. Let #: On —+ On be continuous. Assume that (A) F(y) € F(y* ) for 
every ordinal y. Let a be a fixed ordinal. We prove by transfinite induction 
that 


VB(if a € 8, then F(a) € F(8)). 


Let 98 be an- ordinal. Assume the induction hypothesis 
(VE € B)(if a € €, then F(a) € F(€)). (1H) 


We must prove that 


if a € 8, then F(a) € F(8). 
(8.11) 


Case 1: 3 = Q. Then the conditional statement (8.11) is vacuously true. 


CasE 2: @=y* for an ordinal Y. We shall prove (8.11) as follows: 


aeésspaacy because 8 = y~ 


>acy by definition of ~~ 
=> F(a) € F(y) by (IH) 

=> F(a) € F(y) € Fly") by (a) 

=> F(a) € F(8) as B=". 


Case 3: @ is a limit ordinal. So, F(8) = sup{F(€):€ € B} because F is 
continuous. Thus, (¢) F(€) € F(3) for all €€ 8. We prove (8.11) as 
follows: 


aeéBspacates as § is a limit ordinal 
=> F(a) € F(a*) € F(8)_ by (a) and (4) 
=> F(a) € F(8) by transitivity or equality. 


Therefore, (8.11) is true, and the proof is complete. LU 


Definition 8.2.10. Let (’ be a nonempty set of ordinals. Then (” is a limit set if for all 


a € C,thereisa 9 € C suchthta€e 8. 


In particular, a limit ordinal is a limit set (see Exercise 10(a) on page 185). 


Definition 8.2.11. Let J) be a limit set. We say that ( is cofinalin J) if CC D 


and forall q@ € JD thereisa 8 € C suchthatq E€ 8. 


If © is a cofinal subset of a limit set J), then @ is also a limit set. The 


proofs of the following three useful lemmas will be left as exercises (see 
Exercises 3-5). 


Lemma 8.2.12. Suppose that C’ is a limit set. Then 


(1) sup ¢ isa limit ordinal. 
(2) © iscofinal in sup c. 


Lemma 8.2.13. Let F: On — On be normal and let — be a limit set. If © is 
cofinal in J), then F{D] is a limit set and F|C] is cofinal in F{D]}. 


Lemma 8.2.14. Let |) bea limit set. If @ is cofinal in J), then C = sup D. 


Lemma 8.2.15. Suppose that F: On — On is normal. If © is a limit set, then 
F[C] = F (sup C). 


Proof. Let F: On — On be normal, and let © be a limit set. Let y = supC. 
Thus, 7 is a limit ordinal and © is cofinal in y , by Lemma 8.2.12. Therefore, 
Lemma 8.2.13 implies that #[C] is cofinal in #|y|. Consequently, we have that 
sup £[C] = sup F[y] by Lemma 8.2.14. Because F is continuous, we conclude 


that sup F[y] = F(y). Since y=npc, we obtain 
sup F[C] = sup F[y] = F(y) = F (sup C). 


Therefore, sup F[C] = F(sup C). U 


Corollary 8.2.16. Suppose that 7: On — On is normal. If 7 is a limit ordinal, 
then F (+) is also a limit ordinal. 


Let F be anormal class function -: On — On. Then Lemma 8.2.15 shows 


that sup -[C] = F(sup C) whenever ( is a limit set. Thus, 7 “commutes” with 
the supremum of a limit set. Our next theorem shows that ~ commutes with the 
supremum of any nonempty set of ordinals. 


Theorem 8.2.17. If #: On — On is normal, then sup F[C] = F(sup C), that is, 
sup{F(&) : §& ¢ C} = F(sup C) 


for every nonempty set @ of ordinals. 


Proof. Let #: On — On be normal and let © be a nonempty set of ordinals. 
We shall prove that sup -[C] = /(sup C). There are two cases to consider: either 
C is a limit set or it is not. If © is a limit set, then sup -[C] = F(sup C) by 
Lemma 8.2.15. 


Now suppose that C’ is not a limit set. Then © must have a largest element 
. Therefore, , ,and forall .Since is normal, we conclude that 
forall .Thus, .Hence, . UO 


Remark 8.2.18. Whenever is a set, we shall write to mean that for 
some ordinal ;thatis, is an ordinal-valued set function. 


The mathematician Oswald Veblen was the first to investigate normal class 
functions, and our final theorem on normal class functions is due to him. Before 
we state and prove this theorem, we illustrate how one can apply Theorem 8.2.2 
to define a function by ordinal recursion. Such a function will be used in the 
proof of Veblen’s theorem. 


Let be a class function and let be an ordinal. Now consider the 
following definition by cases: 


(8.12) 


Since is definable by a formula, there exists a formula that expresses 
(8.12). Clearly, for each  , there is a unique so that . Theorem 8.2.2 
implies that there is aset function suchthat ,forall  . So for each natural 
number _, we have that 


(8.13) 


Note that since is the empty function, the first condition in (8.12) produces 
the first condition in (8.13). Because is a function with domain _, the second 
condition in (8.12) yields the corresponding condition in (8.13). Thus, isa 
function with domain _ satisfying: 


(1) ,(2) _ , forall 


We will be using the function __ in the following proof of Veblen’s theorem. 


Fixed-Point Theorem 8.2.19. If is a normal class function, then for each 
ordinal , there exists anordinal suchthat and 


Proof. Let} be normal andlet be anordinal. If ,then we are done. If , 
then — by Exercise 9. By Theorem 8.2.2, there isa function from into the 
ordinals such that 


(1) ,(2) _ , for all 


Let .As__ is strictly increasing, we see that isalimitset and is cofinal 

in (see Exercise 11). Thus, (_ ) by Lemma 8.2.14, and (_ ) by 

Lemma 8.2.15. Hence,( )and( )implythat .Letting , we conclude that 
. Moreover, as, Lemma 8.2.12(2) implies that . LI 


Exercises 8.2 


1. Let be a set of ordinals with largest element . Show that 


. Let be strictly increasing. Prove that is one-to-one. 
3. Prove Lemma 8.2.12. 
4. Prove Lemma 8.2.13. 
5. Prove Lemma 8.2.14. 
. Let be a limit set. Using Theorem 4.2.1, define a one-to-one function 
Conclude that is infinite. 


7. Let be normal and let be a limit set. Prove that is also a limit set. 


. Using Exercise 7, prove Corollary 8.2.16. 


9. Suppose that is strictly increasing. Prove that for all ordinals 


10. Suppose and are normal. 


(a) Prove that the composition is normal. 
(b) Show thatif) , then and 


11. Let) ,)| ,and be as in the proof of Theorem 8.2.19. 


(a) Prove that for all, . Conclude that is a limit set. 
(b) Prove that is cofinal in 


12. Let) benormal. Suppose) .Showthatforall ,if | then 


13. Let be normal, and suppose that is a limit set such that the set is 
cofinalin| .Let)  . Prove that 
14. Let be a poset in which every chain has an upper boundin . Let 


. Foreach chain| , let be the set of proper upper bounds for. Let 


By the axiom of choice, there is a function such that forall, . Note that 


whenever _. Let be a formula that asserts the following: 
Hence, for all , there exists a unique such that . Theorem 8.2.4 thus 
implies that there exists a class function such that , forall . Hence, for 


each ordinal 


P 


(a) Using Lemma 8.2.5, show that for some ordinal 


(b) Let be the least ordinal such that) =. Show that 


is a chain with no proper 


upper bound. 
(c) By assumption, the chain has an upper bound)! —. Show that is a maximal 
element. 
15. (AC) Let be nonempty set and let . Let . Note that . By 
Theorem 3.3.24, let be a choice function for . Let be a formula that asserts the 


following: 


So forall, there exists a unique such that, . Theorem 8.2.4 implies that there exists 


a Class function such that. ,forall| . Thus, 


(a) Using Lemma 8.2.5, show that for some ordinal 

(b) Let be the least ordinal such that . Show that the set function is a 
bijection. 

(c) Show that has a well-ordering. 


Exercise Notes: For Exercise 10(a), use Exercise 7. For Exercise 10(b), apply 
Exercise 9. For Exercise 15(c), use (b) and Theorem 6.1.4. Exercise 14 outlines 
another proof of Zorn’s Lemma 7.1.1. Exercise 15 offers an alternative proof of 
Theorem 7.3.1, the well-ordering theorem. 


8.3 Ordinal Arithmetic 


In Chapter 4, we used the recursion theorem to define the arithmetic operations 
of addition and multiplication on the natural numbers. We shall now define such 
operations on the ordinals by transfinite recursion, that is, by applying 
Theorem 8.2.3. We first show how to define ordinal addition. Consider the 
following definition by cases: 


(8.14) 


Let bea formula that expresses (8.14). Hence, forall and_, there existsa 
unique sothat  . Theorem 8.2.3 implies that for each ordinal , there is a 
class function) suchthat ,forall (see Exercise 1). Thus, 


(8.15) 


Since is the empty function, the first condition in (8.15) corresponds to the 
first condition in (8.14). Moreover, for each ordinal , we see that is an 


ordinal-valued set function with domain  . Thus, the last two conditions of 
(8.15) correspond, respectively, to conditions two and three of (8.14). We can 
now define the operation of addition on the ordinal numbers. 


Definition 8.3.1. For all ordinals and _, define to be the unique ordinal __; that is, 


So, using the above definition of ordinal addition, (8.15) yields the following 
rules for the addition of ordinals. , _, and limit ordinal 


The rules (A1)-(A3) extend those given in Theorem 4.3.3 for the addition of 
natural numbers. Some of the properties of addition on the natural numbers also 
extend to ordinal addition, whereas others do not. For example, addition on the 
natural numbers is commutative; however, addition on the ordinals is not 


commutative. To verify this, let us evaluate and . We compute as 
follows: 
Now we determine the value of . First, note that the set is cofinal in 


Now, since _ isa limit ordinal, we obtain the following: 


Because ,weseethat  . Thus, addition of ordinal numbers is not necessarily 
commutative. The following lemma will allow us to prove that ordinal addition 
is associative. 


Lemma 8.3.2. For each ordinal ,theclass function — is normal. 


Proof. See Exercise 9. LJ 


Using the operation of addition on the ordinals, we can define multiplication 


using Theorem 8.2.3. The method used to define addition can be modified? to 
show that for every ordinal _, there exists a class function __ satisfying the 
following: 


(8.16) 


The operation of multiplication on the ordinal numbers can now be defined. 


Definition 8.3.3. Let and be ordinals. Define to be the unique ordinal _; that 


is, 


Definition 8.3.3 and (8.16) yield the following rules for the multiplication of 
ordinals ,_ , and limit ordinal 


Lemma 8.3.4. If is anonzero ordinal, then _is normal. 


Proof. See Exercise 10. LJ 


Our next lemma will be used to prove that ordinal multiplication distributes 
over addition. The proof also illustrates how one can prove that arithmetic 
compositions result in a normal class function. 


Lemma 8.3.5. Let. and __ be ordinals. Then 


(a) The class function definedby _ is normal. 
(b) The class function definedby _ is also normal. 


Proof. Let. be an ordinal. Then 


Hence, .Since and _ are normal, Exercise 10 on page 193 implies that 
is normal. A similar argument will show that. is normal. L1 


Some of the properties of ordinal addition and multiplication are familiar. 


Theorem 8.3.6. For all ordinals , ,and , 


GQ) 52) 5.08) 5.4 .6) _.© 


Proof. We shall prove only (3) and leave the rest for the exercises. Thus, we 
assume (1). Let. and __ be ordinals. We prove by transfinite induction that 


Let be an ordinal. Assume the induction hypothesis 


We must prove that 


(8.17) 
CaseE1: . Then — byrule (A1), and 
Therefore, 
CaseE2: foranordinal .We shall prove (8.17) as follows: 
Therefore, 


CaASsE3: — is a limit ordinal. We shall show that as follows: 


Therefore, (8.17) is true, and the proof is complete. U 


As and are normal class functions, the following order-preserving and 
left cancellation laws hold. 


Theorem 8.3.7. For all ordinals , ,and , 


(14) ,(@) if ,then ,(3) if ,then ,(4) if ,then implies 


We now define exponentiation on the ordinals. By Theorem 8.2.3, for each 
ordinal suchthat’ , there isaclass function’ satisfying 


(8.18) 


Definition 8.3.8. Let and be ordinals where _. Define to be the unique ordinal 


; that is, 


If , the following exponent rules for ordinal and limit ordinal __ hold: 


Lemma 8.3.9. Let be anordinal. If ,then is normal. 


Proof. See Exercise 11. UJ 


Theorem 8.3.10. For all ordinals) , ,and ,if  ,then 


(1) ,(2) ,(3) if ,then ,(4) 


Exercises 8.3 


1. Let be an ordinal and let be a formula that expresses (8.14). Let be 
an ordinal class function satisfying forall) . Prove, by transfinite induction, 
that, forall 

. Show that and that 

. Show that and). 

. Showthat) ,  , and 

. Show that 

. Let be an ordinal such that 


aw hwWhHN 


(a) Prove that forall, ,(b) Show that 
(c) Show that for each 
(d) Let and). Show that 


7. Prove that forall ordinals) ,if  , then 
8. Suppose .Proveforallordinals ,if then 


9. Prove Lemma 8.3.2. 

10. Prove Lemma 8.3.4. 

11. Prove Lemma 8.3.9. 
12. Forevery ordinal _, prove that there exists an ordinal such that 
13. Let be a nonzero ordinal. Prove that _, for all ordinals 


14, Prove Lemma 8.3.5(b). 
15. Prove Theorem 8.3.6(1). 
16. Prove Theorem 8.3.6(2). 
17. Prove Theorem 8.3.7(1). 
18. Prove the following: 
(a) Let and be ordinals. If} , then for some 


(b) Forall ordinals, , we have that is a limit ordinal. 

(c) Forallordinals) ,if) , then is a limit ordinal. 

(d) If is a limit ordinal, then for some nonzero ordinal 

(e) If is a successor ordinal, then for some ordinal and 
(f) There exists an ordinal such that 


19. Prove Theorem 8.3.10(1). 
20. Prove Theorem 8.3.10(2). 
21. (AC) Let be a countable ordinal. Prove that for all ordinals, if is countable, 


then) ,)| ,and are countable. 
22. Let be an ordinal. For each ordinal , let . Define the relation on as 


follows: For all and in | 


Prove the following: 


(a) is a well-ordered structure, for each ordinal 


(b) Forall ordinals) , the order type of equals 


23. Let be an ordinal. For each ordinal , let . Define the relation on 
as follows: Forall and in  , 


Prove the following: 


(a) is a well-ordered structure for every ordinal 
(b) For all ordinals) ,theordertypeof equals 


Exercise Notes: For Exercises 9-11, apply Theorem 8.2.9. Use Lemma 8.3.2 to 
prove Lemma 8.3.4, and then apply Lemma 8.3.4 to prove Lemma 8.3.9. For 
Exercise 18(d)(e), let. | be the least ordinal such that . For Exercise 21, use 
Corollary 5.2.10 together with transfinite induction. For Exercises 22—23, see 
Exercise 19 on page 76 and Exercise 6 on page 174; also, use transfinite 
induction for part (b). Exercises 22—23 offer another way of defining ordinal 
addition and multiplication. 


8.4 The Cumulative Hierarchy 


The cumulative hierarchy is a collection of sets that are indexed by the ordinal 
numbers. We will be defining the sets in this collection by ordinal recursion. The 
intent is to build up sets by starting with the empty set and then proceed to 
construct new sets using the power set operation in a step by step manner, where 
the ordinals act as the steps. 


Let be a formula that asserts the following definition by cases: 


Hence, for all , there exists a unique sothat . Theorem 8.2.4 implies that 
there exists an ordinal class function satisfying , forall . For each ordinal 
, we let, and thus, 


(8.19) 


The replacement axiom implies that is a set, for each. The class is 
called the cumulative hierarchy of sets. Clearly, (8.19) asserts that 


e 
’ 


e for any ordinal , 
e for any limit ordinal 


Theorem 8.4.1. For every ordinal ,wehave that isa transitive set and that 


Proof. We apply proof by transfinite induction. Let be an ordinal. Assume the 
induction hypothesis 


We must prove that _is a transitive set and that 
CASE1: . Then isa transitive set and 


CasE2: foranordinal .Let and .Since ,weseethat .As ,we 
conclude that’ . From (IH), because ,weseethat isa transitive set. So 
Hence, ; that is, . Thus, is a transitive set. As , it follows that 
Hence, ,because isa transitive set. Since _, the induction hypothesis (IH) 
now implies that 


CaseE3: is alimit ordinal. Thus, by (8.19). Therefore, .Let and .So 
forsome .By(IH),  isatransitive set.Since and_, we infer that 


So  ,andthus,  isatransitive set. LJ 


Theorem 8.4.2. For all ordinals. , we have that 


Proof. We will again apply proof by transfinite induction. Let _ be an ordinal. 
Assume the induction hypothesis 


We must prove that 


Case1: .Clearly ,as 


CaseE2: foranordinal .Hence, .So,( )  by(IH), since . Therefore, 


(i) as. ,by(_ )and Theorem 8.4.1; (ii) because = and 


Because _, items (i) and (ii) imply that 


CASE 3: is a limit ordinal. Thus, . Let  . By (IH), we have that 
Therefore, . Since isa limit ordinal, it follows that . By Theorem 8.4.1, 
.Hence, .Thus, . U 


The cumulative hierarchy of sets and Theorems 8.4.1—8.4.2 are illustrated in 
Figure 8.1. We now discuss some consequences of the regularity axiom. One of 
which (Theorem 8.4.5) will allow us to determine the complexity of any set. 


Figure 8.1. The cumulative hierarchy. 


Regularity Axiom. Every nonempty set — contains an element that is disjoint 
from 


The regularity axiom implies that forall . In mathematics, there are no 
sets that are members of themselves. For example, the set of real numbers _is 
not a real number. So. The regularity axiom, also known as the axiom of 
foundation, eliminates sets that are not relevant for standard mathematics. The 
next theorem identifies three basic consequences of regularity. 


Theorem 8.4.3. The axiom of regularity implies the following: 


(1) No set is amember of itself. 
(2) There arenosets| and  suchthat and 
(3) No function  withdomain _ satisfies for all 


Proof. We shall prove only item (3). Suppose, for a contradiction, that there 
exists afunction withdomain  satisfying( ) forall .Let .So_ is 
nonempty. The regularity axiom implies that there isaset suchthat  . Since 
, there isan suchthat .Clearly, . Moreover, because , we conclude 
from( )that .Hence, andtherefore  , whichis acontradiction. U 


Let betheclass ;thatis, ifandonlyif forsomeordinal . More 
formally, . The next lemma shows that if aset isa “subset” of _, then it is in 


Lemma 8.4.4. Let beaset. If forall , there is anordinal suchthat , 
then for some ordinal 


Proof. Assume that for all , there is an ordinal such that . Let be a 
formula that expresses the following definition by cases: 


(8.20) 


Hence, for all , there exists a unique sothat  . Thus, by our assumption, 


for all , there is a unique nonzero ordinal. suchthat  . By the replacement 
axiom there isaset suchthat .Let .Toshowthat ,let .Thus, for 
some .As_ , Theorem 8.4.1 implies that .Hence, and . LI 


Theorem 8.4.5. Every set is an element of the class 


Proof. Let beaset. By Theorem 6.2.8, let bea transitive setsothat .We 
will first show that 


(8.21) 


Suppose, to the contrary, that this is not the case. Therefore, the set 


is nonempty. By the regularity axiom, there isaset  suchthat .Hence,( ) 
. Since and__ isa transitive set, we infer that . Thus, (_) implies that 
forall ,thereis anordinal suchthat .So by Lemma 8.4.4, we must have 


that forsome ordinal . Therefore, , a contradiction. So (8.21) holds and 
Lemma 8.4.4 implies that for some ordinal . Since , we have that 
Hence, . U 


Theorem 8.4.5 allows us now identify as the universe of sets. Since every 
set belongs to. , we can now measure the complexity of any set, also called the 
rank of a set. The proof of Theorem 8.4.5 shows that for each set _, there is an 
ordinal _—_ such that 


Definition 8.4.6. The rank of aset| ,denoted by _, is the least ordinal such that 


Thus, if ,then . The rank of a set allows us to specify the particular stage 
of the cumulative hierarchy at which the set is first constructed. 


Theorem 8.4.7. The following two properties of the rank operation hold: 


(1) If , then 
(2) 


Proof. First, we prove item (1). Let .So .Let  .Hence, . Thus, for 
some (see Exercise 11). Therefore, .Since , we conclude that 


To prove (2), the replacement axiom implies that is a set, that is, a set of 
ordinals. Hence, by Theorem 8.1.15(2) and Definition 8.1.16, we conclude that 

is an ordinal. We will now show that and. First, observe that , for 
each . So by Exercise 15, for all . Thus, . Hence, . Now we show 
that . From item (1), we conclude that forall .Hence, forall (see 
Exercise 9(b) on page 185). Therefore, by Exercise 13 on page 186. Since 
and, Corollary 8.1.14(2) implies the equality . U 


Exercises 8.4 


1. Prove Theorem 8.4.3(1). 

2. Prove Theorem 8.4.3(2). 

3. Prove thatif  , then 

4. Prove thatif , then 

5. Prove, by induction, that is finite, for all 

6. Using Exercise 5 and Theorem 5.2.9, show that is countable. 
7. Suppose that' . Using Exercise 5, prove that is finite. 

8. Suppose that) isfiniteand| .Provethat| forsome 

9 


. Let be a limit ordinal. Prove that is an inductive set. 
10. Showthatif) , then 


11. Suppose that) . Show that for some 
12. Let be a limit ordinal. Let and. Prove that 
13. Let where is a limit ordinal. Prove that and prove that 
14. Prove that for all ordinals and ,if| ,then 


15. Let. beanordinal. Prove that if and only if 
16. Prove thatif) | then is the least ordinal such that 
17. Let be a set. Prove that 
18. Let be a set. Prove that 


19. Let be a set. Prove that 
20. Let and be sets and let 


(a) Show that 
(b) Show that 


21. Prove the following: 
(a) , whenever is an ordinal. 


(b) For every ordinal _, the equality holds. 


22. Let be the transitive closure of . Prove that 


23. Let be a set and let be a formula. Suppose that 


Prove that there exists a set such that 


Exercise Notes: For Exercise 1, let be aset, and then consider the set . In 
Exercise 6, __ is said to be the set of all hereditarily finite sets. For Exercise 8, 
use Exercise 6 on page 116. For Exercise 22, review both Definition 6.2.9 and 
Exercise 6 on page 155. 


9 
Cardinals 


In Chapter 5, we showed that the natural numbers allow us to measure the size of 
every finite set; that is, for any finite 4 , we defined |A| = n so that the natural 
number 7 identifies the number of elements in 4 . Theorem 5.1.7 showed that 
two finite sets have the same number of elements if and only if there exists a 
one-to-one correspondence between the two sets. On the other hand, we were not 
able to define the “number” |A| when , is an infinite set; however, we did 


define the expression |A| =, |B| to mean that “4 has the same cardinality as 
B.” In other words, |A| =. |B| means that “there is a one-to-one function 
f: A— B that is onto B .” This was done because we did not have “infinite 
numbers.” Now that we have the ordinals, we can define | A] to be a particular 


ordinal whenever A can be well-ordered. We will then be able to show that the 
ordinals |A| and || are equal if and only if there is a bijection f: A—- B. 


9.1 Cardinal Numbers 


His discovery of uncountable sets led Cantor to develop the cardinal numbers, 
which John von Neumann later refined via the following definition. 


Definition 9.1.1. A cardinal number, or cardinal, is an ordinal AK such that for every 


3 € ¥& there is no one-to-one function f “Ko 6. 


Theorem 9.1.2. Every natural number is a cardinal. The ordinal w is a cardinal. 
Every infinite cardinal is a limit ordinal. 


Proof. Let nm € w. Corollary 5.1.9 states that for every m € n, there does not 
exist a one-to-one function f:m-—+m. Thus, every natural number is a 
cardinal. Theorem 5.1.10 implies that for every nm € w, there is no one-to-one 
function f:w-—+n. Therefore, w is also a cardinal. Let & be an infinite 
cardinal. Thus, w € &. We will now prove that & is a limit ordinal. Suppose, 
for a contradiction, that & is a successor ordinal. Hence, « — q* for some 
ordinal @.So a € «. Consider the function f: « — a@ defined by 


0, if€=a; 
F(§) e*, US ew; 
€é, ifweé. 


Clearly f is one-to-one, and thus, & is not a cardinal. LJ 


Theorem 9.1.2 shows that ,,,+ is not a cardinal. We now show that an ordinal 
@ is a cardinal if and only if |a| 4, |y| , forall y € a. 


Theorem 9.1.3. Let a be an ordinal number. Then q@ is a cardinal if and only if 
for every Y € Q, there is no bijection f: a+ 7. 


Proof. Let a be an ordinal. Assume that @ is a cardinal. Let y € @. As there 
is no one-to-one function g:@—7, there is no bijection f:a—-y. 
Conversely, assume that (a) for every y € @, there is no bijection f: a —- +. 
Suppose, for a contradiction, that @ is not a cardinal. Hence, for some y € a, 
there exists a function f: @ — 7 that is one-to-one. Let C = ran(f). Since 
C Cy, if follows that (C,€,.) is a well-ordered structure (see Exercise 5 on 
page 145). Let @ be the order type of (C,€,,). Thus, by Exercise 8 on 
page 185, we have that 8 € +. Hence, 8 € a. By Theorem 8.1.4, H: C + 8 
is a bijection. Because f:a—C is a bijection, (Hof):a—-+ 8 is a 
bijection, which contradicts (a). LJ 


Theorem 9.1.4. Let 4 bea set. Then the following are equivalent: 


(1) A has a well-ordering. 
(2) There is a bijection f: a — A for some ordinal a. 
(3) There is a one-to-one function f: A — a for some ordinal a. 


Proof. We shall show that (1) = (2) > (3) > (1). 


(1) = (2): Assume 4 has a well-ordering. Theorem 8.1.4 and its proof show 
that there is an ordinal § and a bijection H}: A — 6. Theorem 3.3.18 therefore 
implies that 7-1. § —s A isa bijection. 


(2) => (3): Suppose that f: a — A is a bijection for some ordinal a. Then 
the inverse function f~!:; A —+ q@ is one-to-one by Theorem 3.3.18. 


(3) = (1): Assume that there is an injection f: A — a for some ordinal a. 
Exercise 6 shows that 4 has a well-ordering. L] 


The next definition is a straightforward generalization of Definition 5.1.2. 


Definition 9.1.5. If the set A has a well-ordering, then | A] is the least ordinal A for 


which there is a one-to-one function f : A —K. 


Thus, if & is a cardinal, then |K| = & (see Exercise 2). 


Lemma 9.1.6. Suppose that A has a well-ordering, and let « = |A|.Then k is 
a cardinal, and there is a bijection HH: A+. 


Proof. Let A be a set with a well-ordering and let « = |A| . Exercise 9 implies 
that & is a cardinal. Since « = |A|, there exists a function f: A — « that is 
one-to-one. Define the relation = on 4 by x = y iff f(a) € f(y), forall x 
and y in A. Exercise 6 shows that (A, <) is a well-ordered structure. Let ) 
be the order type of (A, <). Exercise 7 on page 185 and Lemma 8.1.11 imply 
that A € «&. Theorem 8.1.10 implies that there is a bijection H: A+). 
Because /{] is one-to-one, AGA, and |A/=x«, it follows from 
Definition 9.1.5 that \ — « . Therefore, }{/: A — « is the desired bijection. LJ 


Lemma 9.1.7. Let 4 and Pf be well-ordered sets. Then 


(1) |A|<. |B] iff |AJEe |B], 2) |Al=_|B| iff |A]J=|B], ©) 
|A] <. |B| iff |A] € |B]. 


Proof. See Exercise 11. LJ 


Let C be the collection C = {K: k is a cardinal} .Is C a set, oris C a 
proper class? Friedrich Hartogs, in 1915, proved the following theorem from ZF 
alone (that is, without using the axiom of choice). This theorem implies that © 
is a proper class (see Exercise 4). 


Hartogs’ Theorem 9.1.8. For every set ¥ , there is a cardinal « so that there is 
no one-to-one function f: Kk + X. 


Proof. We begin by defining the set |’ (via Theorem 2.1.3) to be 
W = {(A,<): AC X and (A, =) is a well-ordered structure}. 


By Theorem 8.1.4, for each (A, <) € W, there exists a unique ordinal § that is 
the order type of (A, <) (see Definition 8.1.9). Thus, the replacement axiom 
implies that the following collection is a set: 


kK = {6:6 is the order type of some (A, <) € W}. 
(9.1) 


We will show that « is the desired cardinal by means of the following three 
claims. The proof of the first claim shows that « is an ordinal, that is, K isa 
transitive set and it is well-ordered by the relation € . 


Claim 1. « is an ordinal. 


Proof. We apply Definition 8.1.1. To show that x is a transitive set, let ~ € 6 
and 6 € «. As 6 € x, there is a well-ordered structure (A, <) € W that has 
order type §. By Theorem 8.1.10, there is a bijection 7: A -—+6. Let 
B=H~'ly|. Thus, BC AC X. Since H is a bijection, it follows that 
(H | B): B + 7 is also a bijection. Exercise 6 implies that there is a well- 
ordering on #8 of order type 7. Hence, y € &. Thus, « is a transitive set. 
Theorem 8.1.15(1) implies that K is well-ordered by €, as K is a set of 
ordinals. So & is an ordinal. (Claim 1) OU 


Claim 2. There is no one-to-one function f: kK + X. 


Proof. Suppose that f: « —+ X is one-to-one. Let A = ran(f). Thus, A C X 
and f: « + A isa bijection. So f~': A — x is also a bijection. Exercise 6 
shows that there is a well-ordering < on A such that # is the order type of 
(A,<). Thus, (A,<)€W and KER, contradicting Lemma 8.1.3(2). 
(Claim 2) UO 


Claim 3. « is a cardinal. 


Proof. Assume, to the contrary, that there is an injection g: * — @ for some 


a@€x#. Since a € kK, (9.1) implies there is a well-ordered structure (A, <) 
where (A,<) has order type a@ and AC X. So there is a bijection 
h:a—-+A- Hence, (hog): k—+ X_ is one-to-one, which contradicts 
Claim 2.(Claim3) OU 


Thus, & is a cardinal, and there is no injection f: « — X (Theorem) U 


Corollary 9.1.9. For every ordinal 7, there is acardinal K suchthaty€ rk. 


Proof. Let 7 be an ordinal. Theorem 9.1.8 implies that there exists a cardinal kK 
such that there is no one-to-one function f: « —- y. Thus, y€«.U 


Corollary 9.1.9 implies that there is a cardinal & such that w € x. It follows 
that K is uncountable. To see this, observe that if K were countable, then there 
would be a function f{: & — w that is one-to-one, and this would imply that « 
is not a cardinal. Hence, & is uncountable. 


Theorem 9.1.10. Let © bea set of cardinals. Then 


(1) © isa cardinal. 
(2) Letxk =supC. If C isa limit set, then 6 € « forall dec. 


Proof. See Exercise 5. LJ 


The aleph number system is used to identify the infinite cardinal numbers. The 
system is defined by transfinite recursion. As w is the first infinite cardinal, w 
will be the starting value of this recursion. We also need to define a “next 
cardinal” function. By Corollary 9.1.9, we know that for each ordinal @ , there is 
a cardinal that is “larger” than a . Let S: On —+ On be defined by 


S(a) = the least cardinal « such that a € kK. 


Clearly, § is a class function. Consider the following definition by cases: 


o, if f is the empty function; 
S(f(y)), if f: y* — On for some ordinal y? 
sup f[B], iff: 6 — On for some limit ordinal B; 
S, if none of the above hold. 
(9.2) 


Let y(f,y) be a formula that expresses (9.2). Hence, by Corollary 9.1.9, for 
every set f, there exists a unique Y so that w{f,y). Thus, Theorem 8.2.4 


implies that there exists a class function %- On — On such that? 


W, if B = 0; 
X(8) S(N(y)), if 8 =-y* for some ordinal 7; 


sup X[8], if 8 is a limit ordinal. 
(9.3) 


Note that S(N(y)) = X(y + 1). Corollary 9.1.9 and Theorem 9.1.10 imply that 
X(3) is an infinite cardinal for every ordinal 8. In addition, for each infinite 
cardinal x , there is an ordinal a such that X{(a@) = «& (see Theorem 9.1.11). 


Exercise 15 and (9.3) imply that &: On — On is normal. For each ordinal 3 
, we now let X3 = X(3). With this slight change in notation, (9.3) becomes 


No, if fp =0; 
Ng = 4 Nya, if 8 = y* for some ordinal 7; 
sup{X% :a€ GB}, if @ is a limit ordinal. 
(9.4) 


The recursive definition (9.4) allows us to use the ordinals to identify all of the 
infinite cardinal numbers, in_ increasing order, as __ follows: 
mem Come Ny 6 hp 6 Oe ye Meee 


It follows that % = w. So, Np is the first infinite cardinal. N) is often referred 
to as aleph-naught. Since w € &,, we see that &; (aleph-one) is the smallest 
uncountable cardinal. 


Our next theorem verifies that the class function &: On — On is increasing 
and “enumerates” all of the cardinals. 


Theorem 9.1.11. For all ordinals @ and 8, if a€ 8, then NR ER. 
Moreover, for every infinite cardinal «, there is an ordinal @ such that % =k. 


Proof. See Exercise 15 and Exercise 20. LJ 


Let Y be a limit ordinal. Recalling Definition 8.2.11, a set C C + is said to 
be cofinal in y if for all a € y, thereisa 8 E€ C suchthatae€ 8. 


Definition 9.1.12. Let Y be a limit ordinal. Then the cofinality of “Y , denoted by cf(4 ), 


is the least ordinal G@ © ‘y for which there exists a function Fa > @ —> 4y such that 


flal is cofinal in ‘Y. 


Theorem 9.1.13. Let ¥ bea limit ordinal. Then cf(7) is a cardinal. 


Proof. See Exercise 13. LJ 


Since every infinite cardinal « is a limit ordinal, the cofinality of & is 
defined. 


Definition 9.1.14. An infinite cardinal AK is said to be regular if cf(K) kK. If 


cf( K) € «,, then K is said to be singular. 


Thus, an infinite cardinal & is regular if and only if whenever f: @ — « and 
a@ €«, then there is y € & such that fla] Cy. The cardinal \, is regular 
because for every m € w and each function f: mn — w, the set f|n| is finite 
(see Exercise 5 on page 116). Hence, f|n| cannot be cofinal in w. Moreover, 
since w is a limit ordinal, we see that %, = sup{X®, :m € w} by (9.4). So X&, 
is a singular cardinal as the function f: w — &, defined by f(m) = &, is such 
that f{w] is cofinal in &,. 


Theorem 9.1.15. Let 7 be a limit ordinal. Then cf(-y) is a regular cardinal. 


Proof. See Exercise 14. LJ 


Theorem 9.1.16 (AC). The cardinal &, is regular. 


Proof. Suppose, to the contrary, that & is not regular. Thus, there is an ordinal 
a@ € X anda function f: a — X such that fla] is cofinal in X,. Exercise 21 
implies that @ is countable and that f(7) is countable for all 7) € a. Hence, 
L{f(m) : 7 € a} is countable by Corollary 5.2.10. Furthermore, Exercise 22 
implies that (_){ f(7) : 7 € a} = &,. Thus, X, is countable, a contradiction. LJ 


In the next section, we will extend Theorem 9.1.16 by showing that %,4 is a 
regular cardinal, for each ordinal a. 


For each ordinal y , we know that the relation € yields a well-ordering on 7. 
In our next lemma, we will prove that there also exists a well-ordering on the set 
y x y. We first define the relation <4 on y x 7, which orders the ordinal pairs 
in y x 74, first by maximum, then by first component, and then by second 


component. Recall that max(a, 3) is the larger of the two ordinals @ and 8. 


(a, 6) Sy (d, Y) if and only if 
max(a, 8) € max(6d, 7) 


Definition 9.1.17. Let ‘Y be an ordinal. The relation =-» on ~y X ‘y is defined by 


V [max(a, 8) = max(d,y) Aa € 4] 


V [max(a, 8) = max(d,y)Aa=dA8€)|]. 


Lemma 9.1.18. For each ordinal 7, the relation <+ is a well-ordering on 
yx. 


Proof. Let 7 be an ordinal. It is straightforward to show that =. is a total order 
on y x ¥y (see Exercise 24). So we will show only that =. is a well-ordering. 
Let ACyxvy be a nonempty set. Let M = {max(a, 3): (a, 8) € A}. 
Clearly, \/ is a nonempty set of ordinals, and so it has an € -least element //. 
Now let & = {@ € 7: (48 € 7)((a, 8) € AA max(a, 8) = p)}. 


Thus, /K is another nonempty set of ordinals with an € -least element 1. 
Finally, lett S={8€7y: (v,8) € AA max(y, 8) = pt}. Since § is also 
nonempty, it has an € -least element 7). One can now show that (v,7) € A 
and (v,n) <5 (a,8) forall (a, 8) € A. Therefore, <+ is a well-ordering on 
yx7.U 


The following result is fairly easy to verify (see Exercise 25). 


Proposition 9.1.19. Let y be an_ ordinal. For each ae€y, 
axa={(E,n) ex 7: (€,7) <y (0, a)}. 


A set C Cy x 4 is said to be cofinal in y x y if for all (a,8)E yxy, 
there isa (n,€) € C such that (a, 8) <4 (7, €) . 


Lemma 9.1.20. Let 7 be an ordinal and § be the order type of <4. Then 
+ € 6, and there is a function f: yy — yxy such that fly] is cofinal in 
pee 


Proof. Let 7 be an ordinal and f: y~ — y x y be defined by f(a) = (a,a). 
It thus follows that a € 8 if and only if f(a) <, f(@), forall a@ and 8 in y. 
Thus, by Lemma 8.1.11 and Exercise 7 on page 185, we have 7 € 6 where § is 
the order type of <+. Let (n,€)€ yx vy and let a = max(n,&). Thus, 
(n, €) <+ (a, a) , and therefore, f|-y]| is cofinal in y x y.O 


So for each ordinal 7 , we now know that the set yy x 7y has a well-ordering. 
Lemma 9.1.6 therefore implies that the cardinal | x | is defined. When 7 is 
an infinite ordinal, one might think that the cardinal |y x | would be larger 
than the cardinal |-+| . In fact, it is not, as we will now establish. 


Theorem 9.1.21. Let Y be an infinite ordinal. Then |y x y| = |9| . 


Proof. Suppose, for a contradiction, that there exists an infinite ordinal y such 
that (a) |y x y| # |y| . By Theorem 8.1.21, we shall assume (without loss of 
generality) that ‘Y is the least such ordinal. Thus, 


(Va € y)(wEa— lax al = fal). 
(9.5) 


Exercise 9 on page 116 implies that |w x w| = |w|. Hence, w € 7. We now 
consider two cases: either 7 is not a cardinal, or it is a cardinal. 


CASE 1: Y is not a cardinal. So Theorem 9.1.3 implies that there is a bijection 
f:y-— a for some ordinal a € y. Since ¥ is infinite, a is also infinite. 
Thus, by (9.5), we have that |a x a| = ja] . Using f, one can easily define a 
bijection g:yxXy—-axXxa and conclude (via Lemma 9.1.7(2)) that 
ly x y| = |y|. which contradicts (a). 


/ 


CASE 2: is a cardinal. Because Y is an ordinal, Lemma 9.1.18 implies that 
(y x y, <5) is a well-ordered structure. Let § be the order type of this 
structure, and let H: y x 7 — 6 be the isomorphism from (7 x yy, <.,) onto 
(6,€) (see Remark 8.1.8). Lemma 9.1.20 states that y€d. As H is a 
bijection, (a) implies that + € 6 . Hence, there exists 7) € y and Y € ¥ so that 
H((n,v)) =. Let a = max(n,v) +1. So, as ¥ is a limit ordinal, a € y. 
Since (7,v) <5 (O,a@), Remark 8.1.8 and Proposition 9.1.19 imply that the 


, 
restricted function }]/~! | + satisfies (Ho Ty) yrax a. 


Because /{~! | + is one-to-one and 7 is infinite, it follows that a is infinite. 
So Ja x a| = |a| by (9.5). Since HH~! | + is one-to-one and |a x al = jal, 
there is an injection f: y - a. As @ € 4, we infer that 7 is not a cardinal. U 


Corollary 9.1.22. If « is an infinite cardinal, then |k x K| =k. 


Corollary 9.1.23. Suppose that @ and § are nonzero ordinals. If either @ or 8 
is infinite, then |a x 8| = max(|a}, |3]). 


Proof. Suppose that at least one of the nonzero ordinals @ and § is infinite. We 
shall assume, without loss of generality, that a € 3. Thus, @ is infinite, and as 


Since 3 x 8 has a well-ordering, every subset of 3 x @ also has a well- 
ordering. Clearly, |{0} x 8| = |8| and |8 x 8| = |8|.So jax 6] =|6|.0 


Corollary 9.1.24. Let «& and 4. be nonzero cardinals. Then 
kK x 6| = max(k, 6) if at least one of the cardinals is infinite. 


Exercises 9.1 


1. Let @ bean ordinal. Show that |a| € a. 

*2. Let K bea cardinal. Prove that || fe. 

3. Prove that Just | — oF I 

“4. Prove that the class {Kk -K 1S a cardinal} is not a set. 

*5. Prove Theorem 9.1.10. 

*6. Let A beaset and @ be an ordinal. Suppose that f : A — @ is one-to- 
one. Define a relation =< on 4 by x & y if and only if f(x) € fly): 
for all @ and Y in A. Prove that =< is a well-ordering on A. If 
f -A Sa is a bijection, then prove that @ is the order type of (A, <). 

7. Let @ bean ordinal. Suppose g: GQ —> X isa function. Let 4 C a. Prove 
that g| A] has a well-ordering and that | g|A]| &- | A| ; 

*8. Assume that 4 has a well-ordering and let B ia A. Prove that 
| Bl e | A| using Definition 9.1.5. 

*9. Prove that if A. has a well-ordering and # | A| , then & is a cardinal. 

10. Let & bea cardinal. Prove that if 3 : A — K isa bijection, then & | A| 


“11. Prove Lemma 9.1.7. 

12. Show that the collection |{/ in the proof of Hartogs’ Theorem 9.1.8 is a set. 
* 13. Prove Theorem 9.1.13. 

* 14. Prove Theorem 9.1.15. 

15. Prove that ¥- Oy — On is strictly increasing. 


16. Let Q@ bea limit ordinal. Prove that cf(Ny) cf(a). 

17. Prove that for all ordinals (9 , we have that 9 € Ne. 

18. Prove for every ordinal @ there is an ordinal {3 such that @ = 8 and 
No = 6. 

19. Prove there is an ordinal € such that ci(é ) = w and Ne =€. 

* 20. Let K bean infinite cardinal. Prove that uy kK. for some ordinal @ . 

“21. Let @ bean ordinal so that Ny EQaE XN, . Prove that Q is countable. 

*22. Let @ € Y where Q@ is an ordinal and ‘Y is a limit ordinal. Let 
f >: @—> vy bes such that f [a is cofinal in ‘Y. Prove that 


U{f(€) :Eea}=y7. 
23. Assume that for any two sets \ and Y, either |X | <a Y 


ly | ae |X | . Now, using Theorem 9.1.8, prove that every set can be well- 


or 


ordered. 
“24. Let ‘Y be an ordinal. Show that -y isa total orderon y X 4y. 


* 25. Prove Proposition 9.1.19. 


Exercise Notes: For Exercise 5, recall that © U (’ . Theorem 8.1.15(2) implies that 


U C’ is an ordinal. For Exercise 6, review Exercise 12 on page 76, Theorem 8.1.10, and 


Lemma 8.1.11. For Exercise 15, apply Theorem 8.2.9. For Exercise 17, use Theorem 8.1.22, 
and Exercise 15 in the limit case. For Exercise 19, read the proof of Theorem 8.2.19. For 
Exercise 20, by Exercise 17, there is a least ordinal such that & 4 Na . For 


Exercise 21, use Exercises 1 and Lemma 9.1.7(2) to show that la No. For 


Exercise 23, use Theorems 9.1.8 and 9.1.4. Exercise 23 shows that the Comparability 
Theorem 7.1.8 implies the axiom of choice. 


9.2 Cardinal Arithmetic 


In Section 8.3, we defined three arithmetic operations on the ordinals; that is, we 
defined addition, multiplication, and exponentiation on the ordinals. So if @ and 
8 are ordinals, then a+ 8, a@-8, and ? (where 2 € a ) are also ordinals. 
For example, w is an ordinal, and hence, w-+w and w-w are ordinals; 
however, w + w and w-w are not cardinals. 


In this section, we will define arithmetic operations on the cardinals in such a 
way that when we add or multiply two cardinals, the result will be a cardinal. 
Thus, ordinal arithmetic and cardinal arithmetic will be different operations. For 
example, cardinal addition is commutative (see Lemma 9.2.2), whereas ordinal 
addition is not commutative (see page 196). 


For a well-ordered set A , recall that in Definition 9.1.5 we identified a unique 
cardinal number «& so that |A| = « . In addition, Lemma 9.1.6 shows that there 


also exists a bijection 7: A + k. 


If § and « are cardinals, the proof of Corollary 9.1.23 allows us to conclude 
that the two sets ({0} x 6) U({1} x &) and 6 x & have well-orderings. Thus, 
we can make the following definition. 


Definition 9.2.1. Let § and K be cardinals. Then the cardinal sum | «& and the 
cardinal product 4 . « are defined, respectively, to be 


) d+K =|({0O} x dJ)U({1} x k)|.@ 6-K=|d xk]. 


In this section, the Greek letters 6,@,«, A will used only to denote cardinals. 
When using these particular Greek letters, the operations 6 + « and §., are to 
be viewed as the cardinal operations given in Definition 9.2.1. The other Greek 
letters will be used to represent ordinals that may (or may not) be cardinals. In 
our next lemma, we show that cardinal addition and multiplication satisfy some 
very familiar properties. 


Lemma 9.2.2. Let 6, 9, and & be cardinals. Then 


(1) 6+K=K+4+6, (2) 6-K=K-6, (0) 0-(64+4) =0-6490-k, (4 
6+ (d+) = (04+ 6)+k,(5) 0-(6-K) = (0-6) -k. 


Proof. Let 6, @, and « be cardinals. We first prove (1). Since one can easily 
define a bijection 


f: ({0} x 6) U({1} x x) + ({0} x K) U({1} x 4), 


Definition 9.2.1 and Lemma 9.1.7(2) imply that 6 + « = K + 6. To prove (2), 
Exercise 17 on page 139 shows that |é x K| =, |K x 6| . Thus, 6-« = «-6 by 
Lemma 9.1.7(2). Items (3), (4), and (5) follow in a similar manner. L] 


Remark 9.2.3. A natural number is also a cardinal, that is, it is a finite cardinal. 
The arithmetic operations defined in Definition 9.2.1, when applied to natural 
numbers, coincide with the operations of addition and multiplication presented 
in Chapter 4. For natural numbers m and 7, let m-+ 7 be the cardinal sum 
given in Definition 9.2.1, and let ™ +-,, n be the natural number resulting from 
the notion of addition identified in Theorem 4.3.3. Let m € w. One can prove 
by mathematical induction that 


|({O} x m) U ({1} x n)| =. |m +, n| 


for every natural number n (using Theorem 5.4.6(1) in the inductive step). 
Lemma 9.1.7 thus implies that |({9} * ™)U ({1} x m)| = |m+y nf. 


Hence, m+n=|m-+,n|. Theorem 5.1.8 thus implies that 


m+n =m -+,, n.Ina similar fashion, one can prove that the cardinal product 
of two natural numbers also agrees with the notion of multiplication given in 
Chapter 4. 


The following lemma confirms that the operations of cardinal addition and 
multiplication have the expected order-preserving properties. 


Lemma 9.2.4. Let 6,0,«,\ be cardinals so that 6 € @ and k € \. Then 


(1) 64+ 4K €04+A,(2) 6-KEO-X. 


Proof. Let 6,0,«, be cardinals. Assume that 6 € @ and k € A. Thus, 6 C @ 


and rCA, as cardinals are also ordinals. Hence, 
({O} x 6) U({1} x x) C ({0} x OJ) U({1} x A), 
Cx<K EPS A. 


Thus, by Exercise 8 on page 217,6+K €0+A andéd-KEO-A.U 


Our next lemma, called the absorption law of cardinal arithmetic, shows 
that the addition and multiplication of two cardinals is quite easy to evaluate, 
whenever one of the cardinals is infinite. 


Lemma 9.2.5. Suppose that § and « are nonzero cardinals. If either § or & is 
infinite, then 6 + K = 6- = max(d,k). 


Proof. Suppose that at least one of the nonzero cardinals § and x is infinite. 
Corollary 9.1.24 thus implies that « - 6 = max(K, 6). We will now prove that 
kK + 6 = max(k, 6). Assume, without loss of generality, that 6 € « . Therefore, 


Kis infinite and, as OER and leu, we have 
CLE eae, IOP MTL eG ee as, 


Clearly, |{1} x «| =« and |x x x| = «. Hence, 6+K=«.U 


As was noted in Remark 9.2.3, the sum and product of two finite cardinals 
agree with the sum and product of two natural numbers as defined in Chapter 4. 
Lemma 9.2.5 shows that the sum and product of two infinite cardinals equals the 
maximum of the two cardinals. Cardinal exponentiation, when applied to infinite 
cardinals, is not as easy to evaluate. 


Cardinal Exponentiation 


Given an infinite set 4 , recall that the cardinal |A| is defined if and only if the 
set A has a well-ordering (see Theorem 9.1.4 and Definition 9.1.5). For this 
reason, we shall be assuming the axiom of choice whenever cardinal 
exponentiation is involved. The axiom of choice, via the Well-Ordering 
Theorem 7.3.1, implies that every set can be well-ordered. In particular, when § 
and « are cardinals, the set 4,. can be well-ordered, where 4. is the set of all 
functions from § tok. 


Definition 9.2.6 (AC). Let § and & be cardinals. Then the cardinal power 6 


is defined to be the cardinal |? | . 


Note that 9° — 1 and 0“ = 0 when & ¥ O (see Remark 3.3.7). 


Lemma 9.2.7 (AC). Let « be a cardinal. Then 2" = |P(x)|. 


Proof. Recall that 2 = {0,1}. Theorem 5.4.3 thus asserts that 
Lemma 9.1.7(2) now implies that 2" = |P(«)|.O 


"2| =c [P(k)] - 


Recall that Cantor’s Theorem 5.4.13 asserts that |A| <, |P(A)| for any set 
A . This theorem can now be restated in terms of cardinal exponentiation. 


Theorem 9.2.8 (AC). Let & be acardinal. Then «~ € 2". 


Proof. Let & be a cardinal, Theorem 5.4.13 states that |«| <, |P(«)|. Thus, 
l«| € |P(K)| by Lemma 9.1.7(3). Now Lemma 9.2.7 implies that « € 2”. 0 


The next theorem identifies three basic facts about cardinal exponentiation. 


Lemma 9.2.9 (AC). Let §, @, and & be cardinals. Then 


(1) Kote Ko ‘ K?, (2) (K . 6)? Ko 69, (3) (n°)? KOO 


Proof. Let 56, @, «& be cardinals. For (1), we first show that 
[re] P °K x Axl. Let A= {0} x 6 and B= {1} x @, which are clearly 


|A| =. [6], 


|B| =c |@|, 
|AU B| =, |6 + 4], 

[°«| =e | 

ia ale’. 


disjoint. Lemma 9.1.6 implies that 


Theorem 5.4.7(1) asserts that |4""«| =, |4« x "|. Therefore, Theorem 5.4.6 
now implies that + | c °K x «| c |x? x Ko . Thus, ,9+9 — po. by 
Lemma 9.1.7(2). A similar argument will establish items (2) and (3). LJ 


In our next lemma, it shall be shown that cardinal exponentiation possesses an 
order-preserving property that closely corresponds to a property identified in 
Theorem 5.4.16. 


Lemma 9.2.10 (AC). Let 6,@,A, and & be cardinals. If 6€ @ and XE, 
then \° € K? when 6 4 0. 


Proof. Let 6,0,,« be cardinals and assume that 6 € @ and \ € x. Clearly, 
|5| <- |@| and |A| <. |x|. If 6 A 0, then |*A| <, |%| by Theorem 5.4.16(3). 
Therefore, we have that \° € «? by Lemma 9.1.7(1). 0 


As an application of Lemmas 9.2.9 and 9.2.10, we have the following result. 


Lemma 9.2.11 (AC). Let & be an infinite cardinal. Then «* = 2". 


Proof. Let « be an infinite cardinal. Lemma 9.2.7 yields the inequality K € 2”. 
Thus, «eG (2")* by Lemma 210: Therefore, 


K* c (a")" pial a Er", 


where the last equality follows from Lemma 9.2.5. Hence, «* = 2". U1 


Theorem 9.2.12 (AC). Let 4 be a set and let & be a cardinal. If |X| € « for 
all X € A, then |) A] € |A|-&. 


Proof. Let 4 be a set and «& be a cardinal. Suppose |X| €«, for each 
X €A.Let 6 =|A| andlet ph: § + A bea bijection. Since |h(a)| € & for 
all @ € 6, the axiom of choice implies that there is a one-to-one function 
fa: h(a) +k, for each q@ € 6. Define the function g: [JA +dxkK by 
g(x) = (a, fa(x)) where a is the least a € 6 such that x € h(a) 


for each x € |).A . We shall prove that the above function g is one-to-one. Let 
x and y be in |)A and assume that g(x) = g(y). Let a € 6 be the least 
such that « € h(a), and let G@ € 6 be the least such that y € h(8). Since 
g(x) =g(y), we have ({a,fa(x)) = (8,fa(y)). So a= 8 and 
fa(x) = fa(y). Therefore, f.(x) = fa(y) andso x = y, since f, is one-to- 
one. Hence, g is one-to-one. By Lemma 9.1.7(1), || A| € |é x «| ; that is, 
IU A] € 6-«-.So [YA] E]A]-«.0 


Theorem 9.2.13 (AC). For each ordinal a, the cardinal &,..; is regular. 


Proof. See Exercise 11. LJ 


Lemma 9.2.14 (AC). |R| = 2. 


Proof. Theorem 5.4.20 asserts that |IR| =. |P(w)| . Lemma 9.1.7(2) therefore 


implies that |IR| = |P(N))|, as w = Ny. Hence, |IR| = 2 by Lemma 9.2.7. 0 


We will now show that the continuum hypothesis has a different formulation 
when one assumes the axiom of choice. Recall that Cantor’s hypothesis (see 
page 133) asserts that any set of real numbers A _ is either countable or has the 
same cardinality as the entire set R . Also note that %, € 2 by Theorem 9.2.8. 


Continuum Hypothesis. There is no set A C R such that |w| <, |A| <, |R| . 


Lemma 9.2.15 (AC). There exists a cardinal 6 so that % € 6 € 2™ if and only 
if there exists an A C R such that |w| <, |A| <, |R|. 


Proof. Let § be a cardinal such that %, € 6 € 2%. Hence, \% € 6 € |IR|_ by 
Lemma 9.2.14. Thus, there is a one-to-one function f: 6 + R. Let A = f{d}. 
Hence, |w| € |A| € |IR|. Therefore, |w| <. |A| <. |IR| by Lemma 9.1.7(3). 
The converse holds similarly. OJ 


Since X; is the first cardinal number greater than Ny, Lemma 9.2.15 implies 
that the continuum hypothesis is equivalent to the equality 2% — yy). 


Continuum Hypothesis (AC). 2 — y,. 


In a similar manner, assuming the axiom of choice, the GCH (see page 137) 
can also be formulated in terms of a cardinal identity. 


Generalized Continuum Hypothesis (AC). 2= = ,,, for every ordinal a. 


We will present a theorem due to Kénig that can be viewed as an extension of 
Cantor’s Theorem 9.2.8. First, we establish two lemmas. 


Lemma 9.2.16 (AC). Let @ be an infinite cardinal such that cf(@) € k € @. If 
|F| =, then there is a set {Ac :€ €K} such that F=U,.,, Ag and 
|Ac| € @ forall € Ek. 


Proof. Let g be an infinite cardinal so that cf(?) EK € @. Assume |F| = 0. 
Let h: @ + F bea bijection and f: K — @ be such that f|x] is cofinal in 9. 
For each € Ek, let Ae = h[f(€)]. Thus, Ae C F and |Ae| = |f(6)] € A. 
Clearly, Uec, Ae G F . To show that F C Uee, Ag, let x € F. Since h isa 
bijection, there is an q € @ such that 2 = h(a). Because f{K]| is cofinal in ¢ , 
there is an € € K such that a € f(€). Hence, x € Ag andso x € Uee, Ag. O 


The proof of our next lemma employs a diagonal argument. 


Lemma 9.2.17 (AC). Let & be a cardinal and let § bea set. If {Ag :€ € K} 
is such that Ae C"S and {g(€):ge€Ac}AS for all €€x, then 
a 7 Us. K Ag . 


Proof. Let « be a cardinal and let § bea set. Suppose that {A¢ :€ € K} is as 
stated in the lemma. Therefore, S \ {g(€): g € Ae} is nonempty for all € € 
. By the axiom of choice there is a function f: « —+ S such that 


f(6)€ S\ {g(E) : 9g € Ac}, for each € € k. 
(9.6) 


So f €“S. We will show that f ¢ een Age. Suppose to the contrary that 
f € Ae for some € € x. Then f(€) € {g(€) : g € Ac}, contradicting (9.6). 


K6nig’s Theorem 9.2.18 (AC). Let « be an infinite cardinal. Then & € cf(2"). 


Proof. Let & be an infinite cardinal. Suppose, to the contrary, that cf(2”) € kK. 
Recall that « € 2", by Theorem 9.2.8. Thus, cf(2") € kK € 2". Let S = 2". 
So [S| = 2". Moreover, l*S 
nS *(2*)) Eig : Mal Or 


= because 


Since |"S| = 2” and cf(2") € & € 2", Lemma 9.2.16 implies that there exists 
an indexed set {Age :€ € K} so that “S=U-.,, Ae and |A¢| € 2” for all 


As |S|=2*, we have that {g(€):g¢€Ac}#S. Therefore, by 
Lemma 9.2.17 "S # Uec, Ag, a contradiction. 0 


Corollary 9.2.19 (AC). 2% 4 X,. 


Proof. By Theorem 9.2.18, X%) € cf(2). So 2% 4,, as cf(X) =N%-O 


Exercises 9.2 


1. (AC) Let 6,0, A, & be infinite cardinals so that § € 9 € \ € K. Simplify 
the cardinals (x? ' “\s and KR R2TR K 

. Prove Lemma 9.2.2(3). 

. Prove Lemma 9.2.2(4). 

. Prove Lemma 9.2.2(5). 

. Prove Lemma 9.2.9(2). 

. Prove Lemma 9.2.9(3). 


aubBwWN 


7. Let @ and § be ordinals. Show that 
(2) Na tg = Nnax(a,6): () No: Ne = Nnax(a,6): (c) 
No + No = Na No > Na = No: 


8. (AC) Let A and \ be cardinals. Assume that 2 ERE 9A and \ is infinite. 
Prove that KA _— 9r : 

9. (AC) Let @ be an ordinal. Prove that \, . E Da , 

10. Let @ be an ordinal. Prove that if G € Ny+4, then |G] € Ny. 


* 11. Prove Theorem 9.2.13. 
12. (AC) Let ,4 be aset and let & be an infinite cardinal. Suppose that |.A| € K and 


|X| € K forall X € A . Prove that IU Al EK. 
13. (AC) Let 4 be a set and let ( be an infinite regular cardinal. Suppose that 


|.A| € @ and |X | E€ @ forall ¥ € A. 


(a) Prove that sup{|X|:X € A} €@. 
(b) Prove that IU Al EO. 


“14, (AC) Suppose that ): A —+ J isa function. Prove that | h| A]| E 
15. (AC) Show that 2%o “2 Naw for any ordinal a . 
16. (AC) Show that there is a class function ]- Qy, —+ On such that 


i 10) = %. Bi Ayr) = 22, 3. 
-(8) = sup{(a) ae 8} when 3 isa limit ordinal. 


Al. 


17. Prove that }: Ory —s On isstrictly increasing, where —) is as in Exercise 16. 
18. Let J. On — On beas in Exercise 16. Prove the following: 
(a) For each ordinal ‘Y, there is an ordinal § such that yy € 9 and (8 ) = 6. 


(b) If pe 8) = 8, then § isa limit ordinal and ‘ =(a) 3 foala€é 8. 
ey} 


“19. (AC) Let {A; :i€ 7} and {B; : i € I} be two indexed sets. Suppose 
that |A;| € |B;|, for each j € J . By the axiom of choice, let p;: A; + B, be 
an injection for each j € /. 


(a) Show that |p,{A,|| € |B;| and conclude that B; \ p;|A;| 4 @ for all 
i 

(b) Get a function ) so that dom(h) = J and h(i) € B; \ p;{A;] for all 
tel. 


(c) For each x € L);-; Ai, define gx € [];-,; Bi (see (7.1) on page 156) 


gx (i) mn if « € A,;; 
by h(i), ifa¢ A,. 


Define F: U,-; Ai > [],<, Bi by F(x) = gy. Prove that F is one-to-one. 


(d) For every j € J, define 7;: [];-, Bi ~ Bj by 7;(g) = g(j). Suppose 
H: ;-, Ai > [],-; Bi is an injection. Show that |7;|H|Aj;|]| € |B;| , for 
each j € ]. Prove that #] is not a bijection. 

(e) Conclude that |\);_; A; Ilicr Bi| and thus, |U,-; Ai| € |T]<7 Bil- 


Exercise Notes: For Exercise 11, use Theorem 9.2.12 and Exercise 10. Also 
review the proof of Theorem 9.1.16. For Exercises 12 and 13(b), one can apply 
Theorem 9.2.12. In our proof of Theorem 9.2.18, we implicitly applied 
Exercise 14. Exercise 16 defines the beth numbers, 1, = 2(A), for each ordinal 
\. The Hebrew letter 4 is called beth. For Exercise 19(d), note that 
Exercise 14 implies that |7;|/|A;]]| € |H|A;]|| . Exercise 19 outlines a proof of 
a theorem due to Konig: If |A;|€|B;| for all je], then 


Wier A;| € ies Bil. 
9.3 Closed Unbounded and Stationary Sets 


We now discuss two concepts that play a prominent role in modern set theory, 
namely, closed unbounded sets and stationary sets. 


Definition 9.3.1. Let K bea limit ordinal and let (* ea kK. Then we say that 


(1) ( is unbounded in K when (* is cofinal in h (see page 213). 
(2) © is closed in &K when for all YER, if CN4 f- @, then 


sup(CNny) EC. 
(3) ( isclubin & when (” is closed and unbounded in & . 


Let « be a limit ordinal, and let C={G8€K:a€Q8} where aEx. 
Clearly © is unbounded in k. If YER and CNy+@, then 
a €sup(Cny) € x. Thus, sup(CNy) € C and © is club in x. On the 
other hand, if w € &, then the set D = {y € kK: y is a successor ordinal} is 
unbounded in & ; however, - is not closed in k . To see this, recall that every 
nonzero natural number is a successor ordinal (Theorem 4.1.6). So 
DNw={n€w:0€n} and sup(D Nw) = w, which is not in D , because 
w isa limit ordinal. Thus, / is not closedin k. 


Definition 9.3.2. Let K be a limit ordinal and let A C 4. We shall say that XY isa 


bounded subset of 4 if and only if ¥ C AM ‘y forsome y € K. 


Lemma 9.3.3. Let « be a limit ordinal and let C C x . Then @ is closed in k 
if and only if * € @ whenever X is a nonempty bounded subset of (’. 


Proof. Let & bea limit ordinal and let C C x. 


(=). Assume C is closedin kK. Let X # @ beso that X C C/M7y for some 
YER. 


CASE 1: X is a limit set. Thus, by Lemma 8.2.12, 5 = sup X is a limit ordinal 
and ¥ is a cofinal in §. Moreover, 6€ yEuR. AS X Cd, we see that 
X CCM6. Therefore, ©@msé is also a cofinal subset of §. Hence, 
sup(C 15) = sup 6 = 5, by Lemma 8.2.14 and Exercise 10 on page 185. Since 
C is closed in « , we conclude that § € C. Thus, sup ¥ EC. 


CASE 2: X is anota limit set. Therefore, X must have a largest element § . So 
5 = sup X and § € X . Since X C C,, wehave that sup ¥ €C. 


(<=). Assume that sup Y € C whenever X is a nonempty bounded subset of C 
. Let y © & be such that C'M-~ is nonempty. As C7 is clearly a bounded 
subset of (’, the assumption implies that sup(C M7) € C’. Hence, © is closed 
in«.U 


Theorem 9.3.4. Let & bea limit ordinal. Let 7 be a nonempty set such that C 
isclosedin. ,forall .Then is also closed in 


Proof. Let be nonempty and bounded. Because ,weseethat for each 
. Since every is closed in , Lemma 9.3.3 implies that , for all 
Therefore, . Thus, by Lemma 9.3.3, we conclude that isclosedin .U 


Before continuing our discussion on club sets, we will focus our attention on 
indexed functions (see Section 3.3.3) whose domain is an ordinal. 


Definition 9.3.5. Whenever is an ordinal, an indexed function shall be calleda_ - 


sequence, or a sequence when the ordinal is understood. 


Soif isa  -sequence, then isa function with domain .Thus,if , 
then ,and . Moreover, if ,then is the empty sequence . We now 
discuss how one can define a sequence by ordinal recursion. Let _ be an ordinal 
and let be a formula that is functional (see page 150). Theorem 8.2.2 thereby 
implies that there isa -sequence _ satisfying 


Such a sequence is said to be defined by ordinal recursion. Let us consider an 
example. Suppose that isa  ~-sequence of ordinals. We can defineanew_ - 
sequence __ of ordinals by the ordinal recursion 


(9.7) 


for all . Thus, (9.7) implies that for all , and it also implies that the 
sequence _ is strictly increasing; that is, | whenever 


In this section, we shall be defining sequences by recursion, and we will then 
use these sequences to prove theorems about club sets in a regular uncountable 
cardinal. We must first make a useful observation. Suppose that is a regular 
uncountable cardinal and that is a sequence such that and_ for all 
Recalling Definition 9.1.14, it follows that there isan. suchthat  . Thus, the 
set isboundedin and 


Lemma 9.3.6. Let | be a regular uncountable cardinal and let __ be closed in 
. Suppose that isasequence where and_ forall .Then 


Proof. Let , , and be as in the statement of the lemma. Hence, for 
each. Since is regular and __, there is an ordinal such that 
Therefore, is abounded subset of . Lemma 9.3.3 implies that . LI 


Let be a regular uncountable cardinal. In our next theorem, we will prove 
that the intersection of two club sets is also a club set. Later we will prove that 
the intersection of fewer than _club sets is also a club set. 


Theorem 9.3.7. Suppose that —_is a regular uncountable cardinal. If = and 
areclubin ,then  isclubin 


Proof. Let. bearegular uncountable cardinal. Suppose that) and are club 
sets in| . Theorem 9.3.4 implies that is closed in  .To show that is 
unbounded in’ , let . We will show that there is an ordinal — such that 
Since isunboundedin ,thereisanordinal suchthat .Because and 
are both unbounded in _, there exists a recursively defined sequence _ of 


ordinals such that andforall , 


(9.8) 


Thus,( ) forall .Let  .Since is a regular uncountable cardinal, we 
have that. . Observe that 


(9.9) 


From (_), it follows that is alimit set in which and are both cofinal. 
Lemma 8.2.14 implies that .Since and  areboundedin  , Lemma 9.3.3 
and (9.9)imply that and .So .U 


Before proving our next lemma, we make another useful observation. Let 
be a regular cardinal and let .If  isasequence suchthat forall , then 
forany thatisunboundedin , 


(9.10) 


The property (9.10) will be implicitly used in the proof of the next lemma. 


Definition 9.3.8. A) -sequence of sets is said to be nested if and only if, | whenever 


Thus, if is a nested sequence of sets, we have that 


where .Of course, means that 


Lemma 9.3.9. Let be a regular uncountable cardinal. If is a nested 
sequence of clubsetsin. and ,then  isclubin 


Proof. Let be a regular uncountable cardinal. Assume that is a nested 
sequence of club sets in’ where’ . Theorem 9.3.4 implies that — is closed in 

, for all nonzero. . We prove that is unbounded in. Suppose, to the 
contrary, that boundedin_ . Let 


(9.11) 
So ,by Theorem 9.3.7. Hence, _ is either is a successor or a limit ordinal. 


CaseE1: foranordinal .Hence,( ) .Since  , (9.11) implies that is 
unbounded in . Thus, (_ ) and Theorem 9.3.7 imply that is unbounded, 
which contradicts (9.11). 


CaAsE2: isalimit ordinal. Since boundedin , there existsa such that ( 
) . Theorem 8.2.2 implies that there is a sequence of ordinals —_ such that 
forall, 


(9.12) 


It follows from (9.12) that 


(1) whenever _, (2) is a limit set, (3) iscofinalin ,forall , (4) 
forall ,because is nested. 


As and , Lemma 9.3.6 and (1) imply that . Items (2)-(3) and 
Lemma 8.2.14 now yield the equation , foreach . Since , we conclude 
from Lemma 9.3.3 and (4) that , forall .Therefore, . This contradicts ( 

), because’. LJ 


We can now generalize Theorem 9.3.7. 


Theorem 9.3.10. Let be a regular uncountable cardinal and let .If isa 
sequence of club setsin. ,then  isclubin 


Proof. Let , ,and _ be as stated. Theorem 9.3.4 thus implies that is 
closed in for every nonzero . Suppose, for a contradiction, that is 
bounded in’ . Let be the least such that and( ) _ is bounded in 
Theorem 9.3.7 implies that is a limit ordinal (see case 1 of the above proof). 
Hence, forall ,theset isclubin’ .So __ isanested sequence of club sets 
(see Exercise 5). Since , Lemma 9.3.9 implies that is unbounded in _, 
contradicting ( ). LJ 


Theorem 9.3.10 may not hold fora  -sequence of club sets in. To verify 
this, let be such that where isa limit ordinal. Every isclubin_ ; 
however, is notclubin' because’ , which is bounded in. On the other 
hand, when is a regular uncountable cardinal, we will soon show that the 
“diagonal intersection” of a  -sequence of club in sets is also a club set. 
First, in the interest of notational correctness, we define 


Definition 9.3.11. Let be a sequence of subsets of a limit ordinal, . Then is the 


diagonal intersection of the sequence. 


Lemma 9.3.12. Let bealimit ordinal. If is asequence of closed sets in  , 
then the diagonal intersection __ is closed in 


Proof. Let and __ beas stated in the lemma. We will prove that the set _is 
closed in. .Let  beabounded subset of .Thus,( ) forsome_ . If 

is not a limit set, then (see case 2 in the proof of Lemma 9.3.3). If is a limit 
set, then let .By( ), we have . Lemma 8.2.12 implies that is a limit 
ordinal and that is cofinalin .Wenow show that _, that is, we prove the 
following claim. 


Claim. 


Proof. Let .Wewill prove that .Let .It follows that is cofinal in 
Hence, by Lemma 8.2.14. Because ,wesee that is boundedin’ .We 
now show that .Let .So  bythedefinitionof .Since _, it follows that 

. Hence, and thus, . As is a nonempty bounded subset of  , 
Lemma 9.3.3 implies that . Therefore, .(Claim) UO 


Since , Lemma 9.3.3 now implies that isclosedin .U 


Lemma 9.3.13. Let be a regular uncountable cardinal. If is a nested 
sequence of club setsin ,then is unbounded in 


Proof. Let be a nested sequence of club sets in’ , where is a regular 
uncountable cardinal. We shall prove that is unbounded in. _Let 
Recursively define the sequence _as follows: Let. besuchthat  . Then for 
all} | 


(9.13) 


It follows from (9.13) that 


(1) whenever _, (2) is a limit set, (3) iscofinalin ,forall , (4) 
forall ,because is nested. 


Let . Lemma 8.2.12 and (1)-(2) imply that andthat isa limit ordinal. As 

and is regular, we have . We will prove that . Items (2)-(3) and 
Lemma 8.2.14 imply that , for each . Since’ , we conclude from 
Lemma 9.3.3 and (4) that , foreach .Because  iscofinalin  and_ is 
nested, we have that for every . Therefore, .So__, and thus, is 
unbounded. L1 


Theorem 9.3.14. Let be a regular uncountable cardinal. If isa - 
sequence of club setsin ,then  isclubin 


Proof. Let and __ be as stated in the theorem. Thus, Lemma 9.3.12 implies 
that the set is closed in| . For each , let. , which is club in by 
Theorem 9.3.10. Hence, is a nested sequence of club sets. Because (see 
Exercise 5) 


Lemma 9.3.13 implies that isunboundedin . LI 


The Club Filter 


Since the intersection of two club sets is again a club set, the club sets generate a 
natural filter. 


Definition 9.3.15. Let be regular uncountable cardinal. Define by 


The set is called the club filter on 


Using Theorem 9.3.7, one can easily verify that is a filter (see Exercise 8). 


Theorem 9.3.16 (AC). Let — be the club filter on , a regular uncountable 
cardinal. Then is.  -complete; that is, whenever isa -sequence of sets 
in suchthat  , then 


Proof. Let be the club filter on =, a regular uncountable cardinal. Suppose 
that isa  ~-sequence of setsin where. Definition 9.3.15 and the axiom 
of choice imply that there isa -sequence ofclubsetsin suchthat , for 
all . Theorem 9.3.10 asserts that isclubin .Since  , we thus conclude 


that by Definition 9.3.15. LJ 


The above proof can be modified so as to show that a club filter is also closed 
under diagonal intersection. 


Theorem 9.3.17 (AC). Let — be the club filter on , a regular uncountable 
cardinal. Then is normal; that is, whenever isa  ~-sequence of setsin  , 
then 


Proof. See Exercise 9. LJ 


Stationary Sets 


The definition of a stationary set is due to Gérard Bloch (1953). Stationary sets 
permeate combinatorial set theory. 


Definition 9.3.18. Let be a regular uncountable cardinal and let . Then 


stationary in ifandonlyif) for every club in set 


Asubset of _ is stationary if and only if has elements in common with 
every closed and unbounded subset of .Forany ,theset is club in 
Hence, there is an such that whenever is stationary in . Soa 
Stationary set is unbounded in .Alsonotethatif isstationaryin , ,and 

,then and_ arealsostationaryin (see Exercise 2). 


Theorem 9.3.7 implies that every club set is stationary in ; however, there 
are stationary sets that are not club in’ . For example, one can easily show that 
the set isstationaryin .Yet  isnotclosed, because and  .Thus, is 
a Stationary set that is not a club. One can prove (using AC) that there exist two 
Stationary sets that are disjoint. It thus follows that the club filter is not an 
ultrafilter (see Exercise 18). 


Our next lemma makes a simple observation concerning Definition 9.3.18. 


Lemma 9.3.19. Let. be aregular uncountable cardinal. Let. .Then is not 
stationary in ifandonlyifthereisaclubin set — such that 


The following fundamental theorem was proved by Géza Fodor in 1956. Let 

be a regular uncountable cardinal and let . A function is said to be 

regressive on if forall . Fodor proved thatifafunction _ is regressive 
ona stationary subset of ,then is thereby constant on a stationary set. 


Theorem 9.3.20 (AC). Let be a regular uncountable cardinal. Suppose that a 
function _is regressive on a stationary set. Then there exists an _ such that 
is stationary. 


Proof. Let be a regular uncountable cardinal and let . Suppose that is 
regressive on a Stationary set. Hence, is stationary. Assume, for a 
contradiction, that for all the set — is not stationary. Lemma 9.3.19 and the 
axiom of choice imply that there isa -sequence of club sets such that (__) 
, for all . By Theorem 9.3.14, the diagonal intersection is club in 
Since is stationary and is a club, there exists an. suchthat  . Thus, (i) 
and (ii) .So( )and(ii)implythat forall , which contradicts (i). UJ 


Because every set that is stationary in is alsounboundedin _, we have our 
next corollary. This corollary will then be followed by a proof of its converse. 


Corollary 9.3.21 (AC). Let — be a regular uncountable cardinal. Suppose that 
is stationary in’ .Thenforall ,if isregressive on _, then there exists 
an suchthat is unbounded in 


Theorem 9.3.22. Let be a regular uncountable cardinal and. Suppose that 


for all, if is regressive on _, then there exists an such that is 
unbounded in .Then is stationary in 


Proof. Let} and  beas stated in the theorem. Assume that forall ,if is 
regressive on __, then there exists an such that is unbounded in 
Suppose, for a contradiction, that is not stationary. Thus, by Lemma 9.3.19, 
there isclubin' set sothat .Define by .Clearly, forall .Since 
is closed and, it follows that forall .Hence, _ is regressive on 
By our assumption, there is an’ suchthat is unboundedin . Since is 
unbounded, let be suchthat . Because is also unbounded, there is a 
such that .So  and_. Thus, by the definitionof ,wehavethat . Since 
, we also have that . This contradiction completes the proof. L 


The following definition will be used in some of the exercises. 


Definition 9.3.23. Let be a regular uncountable cardinal andlet| . Then is said to 


be normal if the following two conditions hold: 


(1) whenever (| is strictly increasing), (2) for every limit ordinal 
is continuous). 


Exercises 9.3 


1. Let and be closed sets in| 4 a limit ordinal. Suppose that and. Let 
be nonempty. Show thatif, , then 


2. Let be a regular uncountable cardinal and let| . Show that if is club in 
, then is club in . Show that if is stationary in , then is 
stationary in 
3. Let where is a regular uncountable cardinal. Show that is club in 
4. Let where isaclubsetin) 4 aregular uncountable cardinal. Show that 
is club in 


5. Let be a limit ordinal and let be any sequence of sets. Foreach _, let 


(a) Prove that is a nested sequence of sets. 
(b) Prove that, for all 
(c) Prove that 


6. Let be such that where is a limit ordinal. Evaluate 
7. Let be a limit ordinal and let be a sequence of subsets of _—_. Prove that 


8. Let be regular uncountable cardinal and let be the club filter on 
Prove the following items: 

(a) and 

(b) If and, then 

(c) If and =, then 


9. Prove Theorem 9.3.17. 


10. Let be the club filter on —, a regular uncountable cardinal. Suppose that isa 
-sequence of sets in where __. Explicitly explain how the axiom of choice (page 66) 

implies that there exists a -Sequence of club sets in such that, for all 

11. Let be a regular uncountable cardinal and let —. Prove that if is normal, then 

the set is club in 

12. Let be normal, where is regular uncountable cardinal. Prove that is club in 

13. Let be club in|, a regular uncountable cardinal. Let be defined by the 


following recursion (see Theorem 8.2.2): Forall| , 


Prove that is normal, and that 


14. Let be aregular uncountable cardinal and let +. Prove that the set __is 
club in 

15. Let bearegular uncountable cardinal. Suppose that __ is stationary in 
and isclubin’ .Provethat _ is stationary. 

16. (AC) Suppose that is a regular uncountable cardinal and that 


(a) Let bea _ -sequence of subsets of . Suppose that the set is 
Stationary in. .Provethat _ is stationary, for some 

(b) Let . Prove that is constant on a stationary set; that is, show that 
there isan  suchthat is stationary in 


17. (AC) Let — be a regular uncountable cardinal. Suppose that isa_ - 
sequence of subsets of .Provethatif is stationary, then __ is stationary, for 


some 
18. Let — be the club filter on, a regular uncountable cardinal. Suppose 

that and  arebothstationaryin’ .Provethat is not an ultrafilter. 

19. (AC) Let be the club filter on , a regular uncountable cardinal. 

Suppose that is stationary in. Prove that there exists an ultrafilter on 

such that and 


Notes 


Preface 
1. In naive set theory, a set is any well-defined collection of objects. 
Chapter 1 


1. In mathematics, “or” always means the inclusive “or,” that is, we allow for 
both conditions to hold (see page 7). Thus, if « € A andzeée B,thenzé€ AUB. 


2. In axiom 3 (8), the variable § is chosen so that it does not appear in the 
formula ¥ (2 ). 


Chapter 2 


1. When proving X = Y, we shall say that X is the first set and y is the second 
set. 


2. Recall that the elements of a set are also sets. 

3. The arrow = shall be used to abbreviate the word “implies.” 
Chapter 3 

1. For posets, we shall write (A, <) for the ordered pair (A, =) . 


Chapter 4 


1. The label (TH) will often be used to identify the induction hypothesis. 


2. The material covered in this subsection will not be used elsewhere in the 
book. 


Chapter 8 

1. A parameter is an unassigned set that may occur in a formula (see page 22). 
2. In (8.14), replace a with 0 and replace f(y)* with f(y)+a. 

3. In (8.14), replace a with 1 and replace f(+)* with f(y)-a. 

Chapter 9 

1. See Remark 8.1.8 and Definition 8.1.9 on page 179. 

2. See Remark 8.2.18 on page 191. 


3. §& (aleph) is the first letter in the Hebrew alphabet. 


References 


[1] Cunningham, Daniel W. 2012. A Logical Introduction to Proof. New York: 
Springer. 


[2] Dauben, J.W. 1990. Georg Cantor: His Mathematics and Philosophy of the 
Infinite. History of science. Princeton: Princeton University Press. 


[3] Dedekind, Richard. 2012. Was sind und was sollen die Zahlen? Cambridge: 
Cambridge University Press. Reprint of the 1893 edition. 


[4] Ebbinghaus, Heinz-Dieter. 2007. Ernst Zermelo. An Approach to His Life 
and Work. Berlin: Springer. In cooperation with Volker Peckhaus. 


[5] Enderton, Herbert B. 1977. Elements of Set Theory. New York: Academic 
Press. 


[6] Enderton, Herbert B. 2001. A Mathematical Introduction to Logic. 2nd edn. 
Burlington, MA: Harcourt/Academic Press. 


[7] Halmos, Paul R. 1974. Naive Set Theory. New York: Springer. Reprint of the 
1960 edition published by Van Nostrand. 


[8] Kanamori, Akihiro. 2012. In praise of replacement. Bulletin of Symbolic 
Logic, 18(1), 46-90. 


[9] Kunen, Kenneth. 2009. The Foundations of Mathematics. Studies in Logic, 
vol. 19. London: College Publications. 


[10] Levy, Azriel. 2002. Basic Set Theory. Mineola, NY: Dover Publications. 
Reprint of the 1979 original published by Springer. 


[11] Moore, Gregory H. 2012. Zermelo’s Axiom of Choice: Its Origins, 
Development, and Influence. Mineola, NY: Dover Publications. Reprint of 
the 1982 original published by Springer. 


[12] Moschovakis, Yiannis. 2006. Notes on Set Theory. 2nd edn. Undergraduate 
Texts in Mathematics. New York: Springer. 


[13] Peano, Giuseppe. 1889. Arithmetices Principia: Nova Methodo Exposita. 
Turin: Fratres Bocca. 


[14] Potter, Michael. 2004. Set Theory and Its Philosophy. New York: Oxford 
University Press. 


Index of Special Symbols 


M FR. 
= > 
= 


wy ~~ wy 


INNNSAGOZQSs & 


~ 
wy y 


— 
=) 


» [a,b], 3 


” 
~ 


CWyWLnNnNNNNNDN 


—_ 
& 


- 
J . 


~3,4,6 


{ >2vB 


~ 


<i 
LU © 
— 
KR 


’ 


Van € s 
=, 19 
{uu} , 32 
{u} , 32 
JF, a2 
AF, 33 

7 a be 
(x,y) ,41 
Ax B,43 
dom(R), 45 
fld(R), 45 
ran(R), 45 
RIA], 46 
RoS,46 
RA, 46 


- 
= 
ae 


, (Sa € A), 15 


R-}, 46 
R~1|B), 46 
aRb , 49 
a~b,49 
[a], 51 
[a], 52 

F: A> B,58 
F(a), 58 

H+, 08 
F[A], 60 
F}A,60 
F-1, 60 

Wy, 59 
Ole, 59 
{F(x):x€ A} , 60 
AB,59 
FoG, 60 
F~1{B], 60 
ran(F’), 60 

(Fj :iE€ I) ,64 
Nicr Fi, 65 
Uier Fi» 65 

{ F; :tET} , 65 
(A, x), 70 

<, 70 

Xo, 74 

0,77 
f(x,y), 78 
at, 83 

Ww, 84 
m+n, 96 
mm? ,97 

m”, 101 

€, 103 
max(m,7n), 106 
e107 


<, 107 

|A| , 112 

ew ,121 
WEP 125 

|A| =e [BI , 129 
|A] <. |B] , 132 
|A] <c |B], 133 
s(u), 144 

s(u), 144 
s*(u), 143 
s<(u), 143 
ews , 146 

<W,4 , 146 

<Wa , 146 

A, 154 

Hier B;, 156 
Cp, 158 

<1, 172 

at, 176 
€,175 

ran(x), 176 

=, 178 

inf C, 182 
max(a, 8), 182 
sup C, 182 
On, 186 

f: X — On, 191 
a+ 8,195 

a 8,196 

a8, 199 

Va, , 202 

V, 204 
rank(A), 205 
|A| , 210 

N(8), 213 


Ng, 213 
NH, 213 
Ny, 213 
cf(y), 213 
X,, 214 
6+, 219 
6-K,219 
oh, S51 
(8) , 226 
le. eee 


Index 


absorption law, 220 
aleph ( ) 
function, 212 
numbers, 212 
aleph-naught, 213 
aleph-one, 213 
algebraic number, 124 
arrow diagram, 58 
Artin, Amil, 157 
associative laws 
for sets, 37 
logical, 11 
atomic formula, 21 
axiom of choice (AC), 66, 156 
axiom of dependent choices (DC), 165 
axiom of foundation, 204 
axiom schema, 25 
axioms of set theory, 23—26, see individual axioms 


beth numbers, 227 

bijection, see function 

binary operation, 78, 96 

bound variable, 14 

bounded quantifier negation laws, 16 
bounded set quantifiers, 15 

bounded subset, 227 

Burali—Forti theorem, 183 


cancellation laws 

on the natural numbers, 106 

on the ordinals, 198 
Cantor, Georg, x, 1, 20, 110, 124, 208 
Cantor’s diagonal argument, 124, 129 


cardinal, 208 
arithmetic , 218-225 
exponentiation, 221 
number, 208 
power, 221 
product, 219 
regular, 213 
singular, 213 
sum, 219 
cardinality, 129 
Cartesian product, 43 
chain, 74 
choice function, 66, 67, 156 
class, 31 
class function, 150, 187 
closed, 227 
closed under finite intersections, 167 
club, 227 
filter, 233 
codomain, 57 
cofinal subset 
of a chain, 158 
of a limit ordinal, 213 
of a limit set, 190 
of a well-ordered set, 215 
cofinality, 213 
cofinite subset, 166 
Cohen, Paul, 67, 133, 138 
commutative laws 
logical, 11 
comparability theorem, 161, 218 
comparable, 73 
compound sentence, 6, 7 
comprehension principle, 20 
congruent, 77, 79 
continuation, 172 
continuous, 189 
continuum hypothesis, 133, 137, 223 
cardinal identity, 224 


generalized, 137 
cardinal identity, 224 

contradiction 

law, 11 

logical, 9 
contrapositive law, 11 
countable axiom of choice, 157 
countable set, 117 
countably infinite, 117 
cumulative hierarchy, 202 


De Morgan, Augustus, 9 
De Morgan’s Laws 

for sets, 38 

logical, 10 
decimal expansion, 127 
Dedekind, Richard, 91, 149 
Dedekind-infinite, 149 
definition 

by recursion, 89 

by transfinite recursion, 150 
denumerable, 119 
dependent choices, 165 
derived equivalence relation, 80 
diagonal argument, 124, 129, 224 
diagonal intersection, 232 
difference, see set operations 
disjoint sets, 2 
distributive laws 

for quantifiers, 19 

for sets, 37, 39 

logical, 11 
divisibility relation, 71 
division algorithm, 109 
DML (De Morgan’s Law), 10 
DNL (double negation law), 11 
domain, 45, 57 
dual filter, 169 
dual ideal, 169 


element, 1 
empty function, 59 
empty set, 2 
empty set axiom, 24, 29 
enumeration, 119 
equality, 2, 28 
equivalence class, 51 
equivalence relation, 50 
derived, 80 
existential statement, 14 
extensionality axiom, 24, 28 


field, 45 
filter, 165 
generated, 165 
maximal, 166 
principal, 165 
finite intersection property, 167 
finite set, 112 
number of elements, 112 
fixed-point theorem, 192 
Fodor, Géza, 235 
formula (of set theory), 21 
with parameters, 22 
Fraenkel, Abraham, 20 
free variable, 14, 22, 24 
function, 57 
bijection, 63 
codomain, 57 
composition, 60 
domain, 57 
empty, 59 
image, 60 
injection, 62 
inverse, 60 
inverse image, 60 
one-to-one, 62 
onto, 63 
range, 60 


restriction, 60 

surjection, 63 
function from 4 to 8,57 
functional, 150, 151 
fundamental theorem 

of equivalence relations, 53 


Gédel, Kurt, 67, 133, 138 

generalized continuum hypothesis, see continuum hypothesis 
greatest element, 73 

greatest lower bound, 72 


Hartogs, Friedrich, 210 

Hartogs’ theorem, 210 

Hausdorff maximal principle, 165 
hereditarily finite sets, 207 
Hilbert, David, 111 


ideal, 168 
generated, 169 
idempotent laws, 11 
identity relation, 44 
iff, 28 
IH, induction hypothesis, 189 
increasing function, 108 
index, 65 
index set, 65 
indexed 
family, 65 
function, 65 
set, 65 
induced relation, 74 
induction on w 
principle of mathematical induction, 85 
strong induction principle, 107 
induction postulate, 92 
inductive set, 84 
infimum, 182 
infinite set, 112 


infinity axiom, 24, 84 

initial segments, 143 
injection, see function 
integers, set of, 2 

intersection, see set operations 
interval, 3 

irrational numbers, 128 
isomorphic posets, 74, 178 
isomorphism, 74, 178 


Konig, Julius, 157 
Konig’s theorem, 224, 227 
kK -complete, 234 
Kuratowski, Kazimierz, 41 


ladder, 159 
larger, 72 
largest element, 73 
least element, 73 
least upper bound, 72 
less than on w, 103 
less than or equal on w , 103 
lexicographic ordering, 77, 174 
limit ordinal, 182 
limit set, 190 
logic law, 9 
logical equivalence, 9 
not equivalent, 10 
lower bound, 72 


Marczewski, Edward, 162 
mathematical induction, 85 
maximal element, 72 
maximal filter, 166 
maximal ideal, 169 
member, 1 

minimal element, 72 


naive set theory, x, 20 


natural number 
addition, 96 
arithmetic , 95-101 
definition, 84 
exponentiation, 101 
multiplication, 97 
recursion, 88 
successor, 96 
natural numbers, set of, 2 
definition, 84 
principle of mathematical induction, 85 
strong induction principle, 107 
trichotomy law, 105 
well-ordering principle, 107 
negation 
double negation law, 11 
nested, 230 
nonempty terms, 65, 156 
normal, 189, 236 
normal filter, 234 
null set, 2 


one-to-one, see function 
one-to-one correspondence, 110 
onto, see function 
order, 70 

partial order, 70 

preorder, 79 

strict order, 71 

total order, 71 

total preorder, 81 

well-ordering, 107, 142 
order type, 179 
order-extension principle, 162, 163 
ordered pair, 41 

first component, 41 

second component, 41 
ordered triple, 43, 92 
ordinal, 175 


addition, 195 
arithmetic , 195-199 
exponentiation, 199 
induction, 183 
limit, 182 
multiplication, 196 
recursion, 186, 187 
successor, 182 
ordinal class function, 187 
ordinal-valued set function, 191 


pair set, 32 
pairing axiom, 24, 31 
pairwise disjoint, 51 
parameter, 22, 24, 25, 153, 186 
partial order, 70 
partially ordered set, 70 
partition, 51 
Peano, Giuseppe, 91 
Peano system, 92 
pigeonhole principle, 114 
dual version, 114 
Poincaré, Henri, 111 
poset, 70 
power set, 3, 35, 133 
cardinality of, 133 
power set axiom, 24, 34 
predicate, 13 
preorder, 79 
preordered set, 79 
prime ideal, 169 
principal filter, 165 
principle of mathematical induction, 85, 107 
product set, 156 
proper class, 31 
proper subset, 2 
properties of inequality on w , 106 
proposition, 6 
propositional components, 6 


propositional logic laws, 10 
propositional sentence, 9 
proset, 79 


QDL (quantifier distribution laws) 
existential laws, 19 
universal laws, 19 
QIL (quantifier interchange laws), 18 
QNL (quantifier negation laws), 15, 16 
quantifiers, 14 
adjacent, 16 
existential, 14 
mixed, 17 
set bounded, 15 
universal, 14 
quotient set, 53 


range, 45 
rank, 205 
rational numbers, set of, 2 
real numbers, set of, 2 
recursion 
on w , 88 
on ordinals, 186, 187 
transfinite, 147, 150 
regressive function, 235 
regular cardinal, 213 
regularity axiom, 24, 204 
relation, 43 
antisymmetric, 55, 70 
composition, 46 
domain, 45 
equivalence, 50 
field, 45, 46 
image, 46 
inverse image, 46 
range, 45 
reflexive, 50 
restriction, 46 


single-rooted, 48 
single-valued, 57 
symmetric, 50 
transitive, 50 
relation from 4 to 8, 44 
relation on A , 44 
replacement axiom, 24, 151, 187, 202, 204, 211 
Russell, Bertrand, 20, 67 
Russell’s paradox, 20, 25 


Schréder—Bernstein Theorem, 135 
separation, 3, 25 
sequence, 228 
set, 1 
set equality 
definition, 2 
proof strategy 
double-subset strategy, 28 
iff strategy, 29 
set function, 146 
set operations 
difference, 4 
intersection 
of a set, 33 
of indexed sets, 65 
of two sets, 4 
union 
of a set, 32 
of indexed sets, 65 
of two sets, 4 
singleton, 32 
singular cardinal, 213 
smaller, 72 
smallest element, 73 
stationary set, 234 
strict order, 71 
strictly increasing, 189 
strong induction principle on w , 107 
subset axiom, 24, 29 


subset relation 
definition, 2 

successor, 83 

successor ordinal, 182 

suitable, 89 

supremum, 182 

surjection, see function 

symmetric difference, 37 


tautology 
law, 11 
logical, 8 
term, 65 
total order, 71 
total preorder, 81 
totally ordered set, 71 
transcendental number, 129 
transfinite, 144 
transfinite induction, 184 
transfinite induction principle, 144 
transfinite recursion theorem 
class form, 150 
function form, 147 
ordinal class version, 187 
ordinal version, 186 
transitive closure, 154 
transitive set, 86 
trichotomy law, 72 
on w, 105 
triple, 92 
truth set, 3 
truth table, 6 


ultrafilter, 166 

theorem, 168 
unbounded, 227 
uncountable set, 124 
union, see set operations 
union axiom, 24, 32 


universal statement, 14 

universe of discourse, 13 

universe of sets, 205 

upper bound, 72, 158 
proper, 158 


vacuous truth, 7, 86, 104 
variable, 13, 21 
bound, 14 
free, 14 
Veblen, Oswald, 192 
Venn diagram, 2 
von Neumann, John, 208 


well-ordered structure, 178 

well-ordering, 142 

well-ordering principle on w , 107 
well-ordering theorem, 171, 173, 188, 194 
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Zermelo—Fraenkel axioms, 23-24 
ZF, 67 

ZFC, 67 

Zorn, Max, 157 
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